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Tlie present invention provides improved adenovirus vectors and packaging cell lines. One type of improved adenoviral vector 
comprises deletions within the E2b region of the adenoviral genome. These E2b^deieted virus are used in conjunction with novel cell 
lines that consiituiively express E2b gene products. The present invention further provides adenoviral vectors deleted for all viral coding 
regions. These "gutted*' vectors pemiit the transfer of large genes to ceils as demonstrated herein by the transfer of the dystrophin gene to 
the muscle of mice. The E2b-deleted vectors and the gutted vectors provide improved adenoviral vectors useful for a wide variety of eene 
therapy applications. 
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Wp PCT/US97/19541 
IMPROVED ADENOVIRUS VECTORS 

FIELD OF THE INVENTION 

The invention relates to improved adenovirus vectors, and more specifically, 
adenovirus vectors useful for gene therapy. 

BACKGROUND 

Adenoviruses (Ad) are double-stranded DNA viruses. The genome of adenoviruses 
(-36 kb) is complex and contains over 50 open reading frames (ORFs). These ORFs are 
overlapping and genes encoding one protein are often embedded within genes coding for other 
Ad proteins. Expression of Ad genes is divided into an early and a late phase. Earlv genes 
are those transcribed prior to replication of the genome while late genes are transcribed after 
replication. The early genes comprise Ela. Elb. E2a. E2b. E3 and E4. The Ela gene 
products are involved in transcriptional regulation: the Elb gene products are involved in the 
shut-off of host cell functions and mRNA transport. E2a encodes the a DNA-binding protem 
(DBP): E2b encodes the viral DNA polymerase and preterminal protein (pTP). The E3 gene 
products are not essential for viral growth in cell culture. The E4 region encodes regulatory 
protem mvolved in transcriptional and post-transcriptional regulation of viral gene expression: 
a subset of the E4 proteins are essential for viral growth. The products of the late genes (e.g., 
LI -5) are predominantly components of the virion as well as proteins involved in the 
assembly of virions. The VA genes produce VA RNAs which block the host cell from 
shutting down viral protein synthesis. 

Adenoviruses or Ad vectors have been exploited for the delivery of foreign genes to 
cells for a number of reasons including the fact that Ad vectors have been shown to be highly 
effective for the transfer of genes into a wide variety of tissues in vivo and the fact that Ad 
infects both dividing and non-dividing cells: a number of tissues which are targets for gene 
therapy comprise largely non-dividing cells. 

The current generation of Ad vectors suffer from a number of limitations which 
preclude their widespread clinical use including: 1) immune detection and elimination of cells 
infected with Ad vectors. 2) a limited carrying capacity (about 8.5 kb) for the insertion of 
foreign genes and regulatory elements, and 3) low-level expression of Ad genes in cells 
infected with recombinant Ad vectors (generally, the e.xpression of Ad proteins is toxic to 
cells). 
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The latter problem was ihoucht to be solved by using vectors containing deletions in 
the El region ot the Ad genome (El gene products are required for viral gene expression and 
replication). However, even with such vectors, low-level expression of Ad genes is observed. 
It is now thought that most mammalian cells contain El -like factors which can substitute for 
the missing Ad El proteins and permit expression of Ad genes remaining on the El deleted 
vectors. 

What is needed is an approach that overcomes the problem of low level expression of 
Ad genes. Such an approach needs to ensure that adenovirus vectors are safe and non- 
immunogenic. 

SUMMARY OF THE INVENTION 

The present invention contemplates two approaches to improving adenovirus vectors. 
The tlrsi approach generally contemplates a recombinant plasmid. together with a helper 
adenovirus, in a packaging cell Hne. The helper adenovirus is rendered safe by utilization of 
loxP sequences. In the second approach, "damaged" adenoviruses are employed. While the 
"damaged" adenovirus is capable of self-propagation in a packaging cell line, it is not capable 
of expressing certain genes (e.g.. the DNA polymerase gene and/or the adenovirus preterminal 
protein gene). 

In one embodiment of the first approach, the present invention contemplates a 
recombinant plasmid, comprising in operable combination: a) a plasmid backbone, 
comprising an origin of replication, an antibiotic resistance gene and a eukaryotic promoter 
element: b) the left and right inverted terminal repeats fITRs) of adenovirus, said ITRs each 
having a 5* and a 3' end and arranged in a tail to tail orientation on said plasmid backbone: 
c) the adenovirus packaging sequence, said packaging sequence having a 5' and a 3' end and 
linked to one of said ITRs: and d) a first gene of interest operably linked to said promoter 
element. 

While it is not intended that the present invention be limited by the precise size of the 
plasmid. it is generally desirable that the recombinant plasmid have a total size of between 27 
and 40 kilobase pairs. It is preferred that the total size of the DNA packaged into an HAM 
derived from these recombinant plasmids is about the length of the wild- type adenovirus 
genome (-36 kb). It is well known in the art that DNA representing about 105% of the wild- 
type length may be packaged into a viral particle: thus the EAM derived from recombinant 
plasmid may contain DNA whose length exceeds by -105% the size of the wild-type genome 
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The size of the recomFmant plasmid may be adjusted using reporter genes and nenes of 
interest having various sizes (including the use of different sizes of introns within these iienes) 
as well as through the use of irrelevant or non-coding DNA fragment which act as "stuffer" 
fragments {l'.<;.. portions of bacteriophage genomes). 

In one embodiment of the recombinant plasmid. said 5' end of said packaging 
sequence is linked to said 3' end of said left ITR. in this embodiment, said first gene of 
interest is linked to said 3" end of said packaging sequence. It is not intended that the present 
invention be limited by the nature of the gene of interest: a variety of genes (includinc both 
cDNA and genomic forms) are contemplated: any gene having therapeutic value may be 
inserted into the recombinant plasmids of the present invention. For example, the transfer of 
the adenosine deaminase (ADA) gene is useful for the treatment of ADA- patients: the 
transfer of the CFTR gene is useful for the treatment of cystitic fibrosis. A wide variety of 
diseases are known to be due to a defect in a single gene. The plasmids. vectors and EAMs 
of the present invention are useful for the transfer of a non-mutated form of a gene which is 
mutated in a patient thereby resulting in disease. The present invention is illustrated using 
recombinant plasmids capable of generating encapsidated adenovirus minicliromosomes 
(EAMs) containing the dystrophin cDNA gene (the cDNA form of this gene is preferred due 
to the large size of this gene): the dystrophin gene is non-functional in muscular dystrophy 
(MD) patients. However, the present invention is not limited toward the use of the dystrophin 
gene for treatment of MD: the use of the utrophin (also called the dystrophin related protein) 
gene is also contemplated for gene therapy for the treatment of MD fTinsley et al. (1993) 
Curr. Opin. Genet. Dev. 3:484 and (1992) Nature 360:591]: the utrophin gene protein has 
been reported to be capable of functionally substituting for the dystrophin gene [Tinsley and 
Davies (1993) Neuromusc. Disord. 3:539]. As the utrophin gene product is expressed in the 
musde of muscular dystrophy patients, no immune response would be directed against the 
utrophin gene product expressed in cells of a host (including a human) containing the 
recombinant plasmids. Ad vectors or EAMs of the present invention. While the present 
invention is illustrated using plasmids containing the dystrophin gene, the plasmids. Ad 
vectors and EAMs of the present invention have broad application for the transfer of any gene 
whose gene product is missing or altered in activity in cells. 

Embodiments are contemplated wherein the recombinant plasmid further comprises a 
second gene of interest. In one embodiment, said second gene of interest is linked to said 3' 
end of said right ITR. In one embodiment, said second gene of interest is a reporter gene. A 
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variety ot reporter genes are contemplated, including but not limited to E. coli P-galactosidase 
gene, the human placental alkaline phosphatase gene, the green tluoresceni protein gene and 
the chloramphenicol aceiyltransferase gene. 

As mentioned above, the first approach also involves the use of a helper adenovirus in 
combination with the recombinant plasmid. In one embodiment, the present invention 
contemplates a helper adenovirus comprising i) first and a second loxP sequences, and ii) the 
adenovirus packaging sequence, said packaging sequence having a 5* and a 3* end. It is 
preferred that said first loxP sequence is linked to the 5" end of said packaging sequence and 
said second loxP sequence is linked to said 3* end of said packaging sequence. In one 
embodiment, the helper virus comprises at least one adenovirus gene coding region. 

The present invention contemplates a mammalian cell line containing the above- 
described recombinant plasmid and the above-described helper virus. It is preferred that said 
cell line is a 293-derived cell line. Specifically, in one embodiment, the present invention 
contemplates a mammalian cell line, comprising: a) a recombinant plasmid. comprising, in 
operable combination: i) a plasmid backbone, comprising an origin of replication, an 
antibiotic resistance gene and a eukaryotic promoter element, ii) the left and right inverted 
terminal repeats (ITRs) of adenovirus, said ITRs each having a 5' and a 3' end and arranged 
in a tail to tail orientation on said plasmid backbone, iii) the adenovirus packaging sequence, 
said packaging sequence having a 5* and a 3' end and linked to one of said ITRs. and iv) a 
first gene of interest operably linked to said promoter element: and b) a helper adenovirus 
comprising i) first and a second loxP sequences, and ii) the adenovirus packaging sequence, 
said packaging sequence having a 5" and a 3' end. As noted previously, said helper can 
further comprise at least one adenovirus gene coding region. 

Overall, the first approach allows for a method of producing an adenovirus 
minichromosome. In one embodiment, this method comprises: A) providing a mammalian 
cell line containing: a) a recombinant plasmid. comprising, in operable combination, i) a 
plasmid backbone, comprising an origin of replication, an antibiotic resistance gene and a 
eukaryotic promoter element, ii) the left and right inverted terminal repeats (ITRs) of 
adenovirus, said ITRs each having a 5' and a 3' end and arranged in a tail to tail orientation 
on said plasmid backbone, iii) the adenovirus packaging sequence, said packaging sequence 
having a 5* and a 3* end and linked to one of the ITRs. and iv) a first gene of interest 
operably linked to said promoter element; and b) a helper adenovirus comprising i) first and a 
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second loxP sequenc>r1..) ai least one adenovirus gene coding reuion. and iii) the adenovirus 
packagini: sequence, said packaging sequence having a 5" and a 3" end; and 
B) growing said cell line under conditions such that said adenovirus gene coding region is 
expressed and said recombinant plasmid directs the production of at least one adenoviral 
5 minichromosome. It is desired that said adenovirus minicliromosome is encapsidated. 

in one embodiment, the present invention contemplates recovering said encapsidated 
adenovirus minichromosome and. in turn, purifying said recovered encapsidated adenovirus 
mmichromosome. Thereafter, said purified encapsidated adenovirus minichromosome can be 
administered to a host (e.g., a mammal). Human therapy is thereby contemplated. 

10 It is not intended that the present invention be limited by the nature of the 

administration of said minichromosomes. All types of administration are contemplated, 
including direct injection (intra-muscular. intravenous, subcutaneous, etc.). inhalation, etc. 

As noted above, the present invention contemplates a second approaches to improving 
adenovirus vectors, in the second approach, "damaged" adenoviruses are employed. In one" 

i 5 embodiment, the present invention contemplates a recombinant adenovirus comprising the 
adenovirus E2b region having a deletion, said adenovirus capable of self-propagation in a 
packaging cell line and said E2b region comprising the DNA polymerase gene and the 
adenovirus preterminal protein gene. In this embodiment, said deletion can be within the 
adenovirus DNA polymerase gene. Alternatively, said deletion is within the adenovirus 
20 preterminal protein gene. Finally, the present invention also contemplates embodiments 
wherein said deletion is within the adenovirus DNA polymerase and preterminal protein 
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The present invention further provides cell lines capable of supporting the propagation 
of Ad virus containing deletions within the E2b region. In one embodiment the invention 
provides a mammalian cell line stably and constitutively expressing the adenovirus El gene 
products and the adenovirus DNA polymerase. In one embodiment, these cell lines comprise 
a recombinant adenovirus comprising a deletion within the E2b region, this E2b-deleted 
recombinant adenovirus being capable of self-propagation in the cell line. The present 
invention is not limited by the nature of the deletion within the E2b region. In one 
embodiment, the deletion is within the adenoviral DNA polymerase gene. 

The present invention provides cells lines stably expressing El proteins and the 
adenoviral DNA polymerase, wherein the genome of the cell line contains a nucleotide 
sequence encoding adenovirus DNA polymerase operably linked to a heterologous promoter. 
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In a particularly preferred embodiment, the cell line is selected Trom the iiroup consisting o( 
the B-6. B-9. C-1. C-4. C-7. C-13- and C-14 cell lines. 

The present invention further provides cell lines which further constitutively express 
the adenovirus preterminal protein (pTP) gene product (in addition to El proteins and DNA 
polvmerase). In one embodiment, these pTP-expressing cell lines comprise a recombinant 
adenovirus comprising a deletion within the E2b region, the recombinant adenovirus being 
capable of self-propagation in the pTP-expressing cell line. In a preferred embodiment, the 
deletion within the E2b region comprises a deletion within the adenoviral preterminal protein 
gene. In another preferred embodiment, the deletion within the E2b region comprises a 
deletion within the adenoviral (Ad) DNA polymerase and preterminal protein genes. 

In a preferred embodiment, the cell lines coexpressing pTP and Ad DNA polymerase, 
contain within their genome, a nucleotide sequence encoding adenovirus preterminal protein 
operably linked to a heterologous promoter. In the invention is not limited by the nature of 
the heterologous promoter chosen. The art knows w^ell how to select a suitable heterologous 
promoter to achieve expression in the desired host cell {e.^.. 293 cells or deri\ative thereof). 
In a particularly preferred embodiment, the pTP- and Ad polymerase-expressing cell line is 
selected from the group consisting of the C-K C-4. C-7. C-13. and C-14 cell lines. 

The present invention provides a method of producing infectious recombinant 
adenovirus particles containing an adenoviral genome containing a deletion within the E2b 
region, comprising: a) providing; i) a mammalian cell line stably and constitutively 
expressing the adenovirus El gene products and the adenovirus DNA polymerase: ii) a 
recombinant adenovirus comprising a deletion within the E2b region, the recombinant 
adenovirus being capable of self-propagation in said cell line: b) introducing the recombinant 
adenovirus into the cell line under conditions such that the recombinant adenovirus is 
propagated to form infectious recombinant adenovirus particles: and c) recovering the 
infectious recombinant adenovirus particles. In a preferred embodiment, the method further 
comprises d) purifying the recovered infectious recombinant adenovirus particles. In yet 
another preferred embodiment, the method further comprises e) administering the purified 
recombinant adenovirus particles to a host which is preferably a mammal and most preferably 
a human. 

In another preferred embodiment the mammalian cell line employed in the above 
method further constitutively expresses the adenovirus preterminal protein. 
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The present Wention funher provides a recombinani^Ssmid capable of replicating in 
a bacterial host comprising adenoviral E2b sequences, the E2b sequences containing a deletion 
within the polymerase gene, the deletion resulting in reduced polymerase activity. The 
present invention is not limited by the specific deletion employed to reduce polymerase 
activity. In a preferred embodiment, the deletion comprises a deletion of nucleotides 8772 to 
9385 in SEQ ID NO:4. In one preferred embodiment, the recombinant plasmid has the 
designation pApol. In another preferred embodiment, the recombinant plasmid has the 
designation pBHGl lApol. 

The present invention also provides a recombinant plasmid capable of replicating in a 
bacterial host comprising adenoviral E2b sequences, the E2b sequences containing a deletion 
within the preterminal protein gene, the deletion resulting in the inability to e.xpress functional 
preterminal protein without disruption of the VA RNA genes. The present invention is not 
limited by the specific deletion employed to render the pTP inactive: any deletion within the 
pTP coding region which does not disrupt the ability to express the Ad VA RNA tjenes may 
be employed. In a preferred embodiment, the deletion comprises a deletion of nucleotides 
10.705 to 1 1.134 in SEQ ID NO:4. In one preferred embodiment, the recombinant plasmid 
has the designation pApTP. In another preferred embodiment, the recombinant plasmid has 
the designation pBHGI lApTP. 

In a preferred embodiment, the recombinant plasmid containing a deletion with the 
pTP region further comprises a deletion within the polymerase gene, this deletion resulting in 
reduced (preferably absent) polymerase activity. The present invention is not limited by the 
specific deletion employed to inactivate the polymerase and pTP genes. In a preferred 
embodiment, the deletion comprises a deletion of nucleotides 8.773 to 9586 and 1 1.067 to 
12.513 in SEQ ID NO:4. in one preferred embodiment, the recombinant plasmid has the 
designation pAXBApolApTPVARNA+tI3. In another preferred embodiment, the recombinant 
plasmid has the designation pBHGllApolApTPVARNA+tl3. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure lA is a schematic representation of the Ad polymerase expression plasmid 
pRSV-pol indicating that the Ad2 DNA polymerase sequences are under the transcriptional 
control of the RSV-LTR/promoter element and are Hanked on the 3" end by the SV-40 small 
t intron and SV-40 polyadenylation addition site. 
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Figure 1 B is a scTiemalic representation of the expression^ 
indicating that the Ad2 preterminal protein sequences are under the transcriptional control of 
the RSV-LTR/promoter element and are tlanked on the 3" end by the SV-40 small-t miron 
and SV-40 polyadenylation signals. 
5 Figure 2 is an ethidium bromide-stained gel depicting the presence of Ad pol DNA 

sequences in genomic DNA from LP-293 cells and several hygromycin-resistant cell lines. 
The -750 bp PCR products are indicated by the arrow. 

Figure 3 is an autoradiograph depicting the results of a viral replication- 
complementation assay analyzing the functional activity of the Ad polymerase protein 
10 expressed by LP-293 cells and several hygromycin resistant cell lines. The 8.010 bp HindlU 
fragments analyzed by densitometry are indicated by an arrow. 

Figure 4 is an autoradiograph indicating that cell lines B-6 and C-7 contained a 
smaller and a larger species of Ad polymerase mRNA while LP-293 derived RNA had no 
detectable hybridization signal. The location of the two species of Ad polymerase mRNA are 
15 indicated relative to the 28S and IBS ribosomal RNAs. and the aberrant transcript expressed 
by the B-9 cell line is indicated by an arrow. 

Figure 5 is an autoradiograph showing which of the cell lines that received the 
preterminal protein expression plasmid indicated the presence of pRSV-pTP sequences (arrow 
labelled "pTP'*) and El sequences (arrow labelled "El"). 
20 Figure 6 is an autoradiograph indicating in which of the cell lines transcription of 

preterminal protein is occurring (arrow labelled 3kb"). 

Figure 7 is an autoradiograph indicating that the expression of the Ad-polymerase 
could overcome the replication defect of H5sub\00 at non-permissive temperatures. 

Figure 8 graphically depicts plaque titre for LP-293, B-6 and C-7 cell lines infected 
25 with wtAd5. H5ts36. or H5.vw/?100, and the results demonstrate that the C-7 cell line can be 
used as a packaging cell line to allow the high level growth of El. preterminal, and 
polymerase deleted Ad vectors. 

Figure 9 is an autoradiograph showing that the recombinant pol" virus is viable on 
pol-expressing 293 cells but not on 293 cells which demonstrates that recombinant Ad viruses 
30 containing the 612 bp deletion found within pApol lack the ability to express Ad polymerase. 

Figure 10 is a schematic representation of the structure of pAd5pdys wherein the two 
inverted adenovirus origins of replication are represented by a left and right invened terminal 
repeat (LITR/RJTR). PI and P2 represent location of probes used for 'Southern blot analysis. 
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Figure 1 1 graphically illustrates the total number of transducing adenovirus particles 
produced (output) per serial passage on 293 cells, total input virus of either the helper (hpAP) 
or AdSpdys. and the total number of cells used in each infection. 

Figures I2A-B show Southern blot analyses of viral DNA from lysates 3. 6. 9 and 12. 
5 digested with the restriction enzymes Bs.sHU. .\rul and EcoRV . For the analyses, fragments 
from the C terminus of mu.s musculiis dystrophin cDNA (A) or the N termmus of E colt (J- 
galactosidase (B) were labeled with dCTP^- and used as probes. 

Figures 13A-B show the physical separation of AdSpdys from hpAP virions at the 
third (.A) and final (B) stages of CsCl purification. 
10 Figure 14 graphically depicts the level of contamination of Ad5pdys EAMs by hpAP 

virions obtained from the final stage of CsCl purification as measured by p-galactosidase and 
alkaline phosphatase expression. The ratio of the two types of virions - AdSpdys EAMs 
(LacZ) or hpAP ( AP) in each fraction is indicated in the lower graph. 

Figures I5A-B are western blot (immunoblot) analyses of protein e.xtracts from mdx 
15 myoblasts and myotubes demonstrating the expression of P-galactosidase (A) and dystrophin 
(B) in cells infected with AdSpdys EAMs. 

Figures I6A-C depict immunofluorescence of dystrophin expression in wild type 
MM 14 myotubes (A), uninfected nulx (B) and infected mdx myotubes (C). 

Figure 17 is a schematic representation of the MCK/lacZ constructs tested to determine 
20 what portion of the ^ 3.3 kb DNA fragment containing the enhancer/promoter of the MCK 
gene is capable of directing high levels of expression of linked genes in muscle cells. 

Figure 18 is a schematic representation of a GFP/p-gal reporter construct suitable for 
assaying the expression of Crc recombinase in mammalian cells. 

Figure 19 is a schematic representation of the recombination event between the loxP 
25 : shuttle \ ector and the AdScJnOO 1 genome. 

DEFINITIONS 

To facilitate understanding of the invention, a number of terms are defined below. 

The term "gene" refers to a DNA sequence that comprises control and coding 
30 sequences necessary for the production of a polypeptide or precursor thereof The polypeptide 
can be encoded by a full length coding sequence or by any portion of the coding sequence so 
long as the desired enzymatic activity is retained. The term "gene" encompasses both cDNA 
and genomic forms of a uiven uene. 
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The term "wild^^e" refers lo a eene or gene product \^Wri has the characteristics oi 
that eene or gene product when isolated from a naturally occurring source. A wiid-iype gene 
IS that which is most frequently observed in a population and is thus arbitrarily designated the 
"normal" or "wild-type" form of the gene. In contrast, the term "modified" or "mutant" refers 
5 to a gene or gene product which displays modifications in sequence and or functional 

properties {i.e. altered characteristics) when compared to the wild-type gene or gene product, 
li is noted that naturally-occurring mutants can be isolated; these are identified by the fact that 
they have altered characteristics when compared to the wild-type gene or gene product. 

The term "oligonucleotide" as used herein is defined as a molecule comprised of two 
10 or more deoxyribonucleotides or ribonucleotides, usually more than three (3). and typically 

more than ten (10) and up to one hundred (100) or more (although preferably between twenty 
and thirty). The exact size will depend on many factors, which in turn depends on the 
ultimate function or use of the oligonucleotide. The oligonucleotide may be generated in any 
manner, including chemical synthesis. DNA replication, reverse transcription, or a 
15 combination thereof 

As used herein, the term "regulatory element" refers to a genetic element which 
controls some aspect of the expression of nucleic acid sequences. For example, a promoter is 
a regulatory element which facilitates the initiation of transcription of an operably linked 
coding region. Other regulatory elements are splicing signals, polyadenylation signals. 
20 termination signals, eic. (defined infra). 

Transcriptional control signals in eucaryotes comprise "promoter" and "enhancer" 
elements. Promoters and enhancers consist of short arrays of DNA sequences that interact 
specifically with cellular proteins involved in transcription [Maniatis. T. ei uL, Science 
236:1237 ( 1987)]. Promoter and enhancer elements have been isolated from a variety of 
25 eukaryotic sources including genes in yeast, insect and mammalian cells and viruses 

(analogous control elements, i.e.. promoters, are also found in procaryotes). The selection of 
a particular promoter and enhancer depends on what cell type is to be used to express the 
protein of interest. Some eukaryotic promoters and enhancers have a broad host range while 
others are functional in a limited subset of cell types [for review see Voss. S.D. ci ai. Trends 
30 Biochem. Sci.. 1 1:287 (1986) and Maniatis. T. et al., supra (1987)]. 

The term "recombinant DNA vector" as used herein refers to DNA sequences 
containing a desired coding sequence and appropriate DNA sequences necessary for the 
expression of the operably linked coding sequence in a particular host organism (e.g. 



- 10 - 



" - - WO W17783- |K" PCT/US97/r9541 

mammal). DNA scqu^R necessary tor expression in procaryMf include a promoter, 
opiionaliy an operator sequence, a ribosome binding site and possibly other sequences. 
Eukaryotic cells are known to utilize promoters, poiyadenlyation signals and enhancers. 

The terms "in operable combination", "in operable order" and "operably linked" as 
used herein refer to the linkage of nucleic acid sequences m such a manner that a nucleic acid 
-molecule capable of directing the transcription of a given gene and/or the synthesis of a 
desired protein molecule is produced. The term also refers to the linkage of amino acid 
sequences in such a manner so that a functional protein is produced. 

The term "genetic cassette" as used herein refers to a fragment or segment of DNA 
10 containing a particular grouping of genetic elements. The cassette can be removed and 

• inserted into a vector or plasmid as a single unit. A plasmid backbone refers to a piece of 
DNA containing at least plasmid origin of replication and a selectable marker gene (e.g., an 
antibiotic resistance gene) which allows for selection of bacterial hosts containing the plasmid; 
the plasmid backbone may also include a polylinker region to facilitate the insenion of 
i 5 genetic elements within the plasmid. When a particular plasmid is modified to contain non- 
plasmid elements {e.g.. insertion of Ad sequences and/or a eukaryotic gene of interest linked 
to a promoter element), the plasmid sequences are referred to as the plasmid backbone. 

Because mononucleotides are reacted to make oligonucleotides in a manner such that 
the 5" phosphate of one mononucleotide pentose ring is attached to the 3' oxygen of its 
:0 neighbor in one direction via a phosphodiester linkage, an end of an oligonucleotide is 
referred to as the "5" end'" if its 5" phosphate is not linked to the 3" oxygen of a 
mononucleotide pentose ring and as the "j" end" if its 3' oxygen is not linked to a 5" 
phosphate of a subsequent mononucleotide pentose ring. As used herein, a nucleic acid 
sequence, even if internal to a larger oligonucleotide, also may be said to have 5" and 3" ends. 
.V \^hen two different, non-overlapping oligonucleotides anneal to different regions of 
the same linear complementary nucleic acid sequence, and the 3" end of one oligonucleotide 
points towards the 5" end of the other, the former may be called the "upstream" 
.oligonucleotide and the latter the "downstream" oligonucleotide. 

The term "primer" refers to an oligonucleotide which is capable of acting as a point of 
30 initiation of synthesis when placed under conditions in which primer extension is initiated. 

An oligonucleotide "primer" may occur naturally, as in a purified restriction digest or may be 
produced synthetically. 
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A primer is serened to be "substaniially" complementahrno a strand of specific 
sequence of the template. A primer must be sufnciently complementary to hybridize with a 
template strand for primer elongation to occur. A primer sequence need not retlect the exact 
sequence of the template. For example, a non-complementary nucleotide fragment mav be 
attached to the 5* end of the primer, with the remainder of the primer sequence being 
substantially complementary to the strand. Non-complementary bases or longer sequences can 
be interspersed into the primer, provided thai the primer sequence has sufficient 
complementarity with the sequence of the template to hybridize and thereby form a template 
primer complex for synthesis of the extension product of the primer. 

"Hybridization" methods involve the annealing of a complementary sequence to the 
target nucleic acid (the sequence to be detected). The ability of two polymers of nucleic acid 
containing complementary sequences to find each other and anneal through base pairing 
interaction is a well-recognized phenomenon. The initial observations of the "hybridization" 
process by Marmur and Lane. Proc, NaiL Acad. Sci. USA 46:453 (1960) and Doty cf uL. 
Proc. SluI. Acad. Sci. USA 46:461 (1960) have been followed by the refinement of this 
process into an essential tool of modern biology. 

The complement of a nucleic acid sequence as used herein refers to an oligonucleotide 
which, when aligned with the nucleic acid sequence such that the 5' end of one sequence is 
paired with the 3' end of the other, is in "antiparallel association." Certain bases not 
commonly found in natural nucleic acids may be included in the nucleic acids of the present 
invention and include, for example, inosine and 7-deazaguanine. Complementarity need not 
be perfect: stable duplexes may contain mismatched base pairs or unmatched bases. Those 
skilled in the an of nucleic acid technology can determine duplex stability empirically 
considering a number of variables including, for example, the length of the oligonucleotide, 
base composition and sequence of the oligonucleotide, ionic strength and incidence of 
mismatched base pairs. 

Stability of a nucleic acid duplex is measured by the melting temperature, or "T,„." 
The T,„ of a particular nucleic acid duplex under specified conditions is the temperature at 
which on average half of the base pairs have disassociated. The equation for calculating the 
T,„ of nucleic acids is well known in the art. 

The term "probe" as used herein refers to a labeled oligonucleotide which forms a 
duplex structure with a sequence in another nucleic acid, due to complementarity of at least 
one sequence in the probe with a sequence in the other nucleic acid. 
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The term "lab^W used herein refers to any atom or ni^uie which can be used to 
provide a detectable (prelerably quantifiable) signal, and which can be attached to a nucleic 
acid or protein. Labels may provide signals detectable by fluorescence. radioactiNity. 
colonmetry. jiravimetry. X-ray diffraction or absorption, magnetism, enzymatic actixity. and 
the like. 

The terms "nucleic acid substrate" and nucleic acid template" are used herein 
interchangeably and refer to a nucleic acid molecule which may comprise single- or double- 
stranded DNA or RNA. 

"Oligonucleotide primers matchmg or complementary to a gene sequence" refers to 
oligonucleotide primers capable of facilitating the template-dependent synthesis of single or 
double-stranded nucleic acids. Oligonucleotide primers matchmg or complementary to a gene 
sequence may be used in PCRs. RT-PCRs and the like. 

.A "consensus gene sequence" refers to a gene sequence which is derived by 
comparison of two or more gene sequences and which describes the nucleotides most often 
1 5 present in a given segment of the genes; the consensus sequence is the canonical sequence. 

The term "polymorphic locus" is a locus present in a population which shows variation 
between members of the population (i.e., the most common allele has a frequency of less than 
0.95). In contrast, a "monomorphic locus" is a genetic locus at little or no variations seen 
between members of the population (generally taken to be a locus at which the most common 
20 allele exceeds a frequency of 0.95 in the gene pool of the population). 

The term "microorganism" as used herein means an organism too small to be observed 
with the unaided eye and includes, but is not limited to bacteria, viruses, protozoans, fungi, 
and ciliates. 

. The term "microbial gene sequences" refers to gene sequences derived from a 

25 . microorganism. 

The term "bacteria" refers to any bacterial species including eubacterial and 
archaebacierial species. 

The term "virus" refers to obligate, ultramicroscopic. intracellular parasites incapable 
of autonomous replication {i.e.. replication requires the use of the host cell's machinery). 
30 Adenoviruses, as noted above, are double-stranded DNA viruses. The left and right inverted 
terminal repeats (ITRs) are short elements located at the 5" and 3" termini of the linear Ad 
genome, respectively and are required for replication of the viral DNA. The left ITR is 
located between 1-130 bp in the Ad genome (also referred to as 0-0.5 mu). The right ITR is 
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locaied trom -3. 7500^^0 the end of the genome (also reterr^Rf as <')0.5-100 mu). The two 

ITRs are inverted repeals of each other. For clarity, the left ITR or 5' end is sued define the 
5* and 3' ends of the ITRs. The 5' end of the left ITR is locaied at the extreme 5' end of the 
linear adenoviral genome: picturing the left ITR (LITR) as an arrow extending from the 5" 
end of the genome, the tail of the 5' ITR is located at mu 0 and the head of the left ITR is 
located at - 0.5 mu (further the tail of the left ITR is referred to as the 5* end of the left ITR 
and the head of the left ITR is referred to as the 3' end of the left ITR). The tail of the right 
or 3* ITR is located at mu 100 and the head of the rigt ITR is located at - mu 99.5: the head 
of the right ITR is referred to as the 5' end of the right ITR and the tail of the right ITR is 
referred to as the 3" end of the right ITR (RITR). In the linear Ad genome, the ITRs face 
each other with the head of each ITR pointing inward toward the bulk of the genome. When 
arranged in a "tail to tail orientation" the tails of each ITR (which comprise the 5' end of the 
LITR and the 3' end of the RITR) are located in proximity to one another while ihe heads of 
each ITR are separated and face outward {see for example, the arrangement of the ITRs in the 
EAM shown in Figure 10 herein). The "adenovirus packaging sequence" refers lo the 4^ 
sequence which comprises five (AI-AV) packaging signals and is required for encapsidation 
of the mature linear genome: the packaging signals are locaied from -194 to 358 bp in the Ad 
genome (about 0.5-1.0 mu). 

The phrase "at least one adenovirus gene coding region" refers to a nucleotide 
sequence containing more than one adenovirus gene coding gene. A "helper adenovirus" or 
"helper \ irus" refers to an adenovirus which is replication-competent in a particular host cell 
(the host may provide Ad gene products such as EI proteins), this replication-competent virus 
is used to supply in trans functions (e.ff.. proteins) which are lacking in a second replication- 
incompetent virus: the first replication-competent virus is said to "help" the second 
replication-incompetent virus thereby permitting the propagation of the second viral genome 
in the cell containing the helper and second viruses. 

The term "containing a deletion within the E2b region" refers to a deletion of at least 
one basepair (preferably more than one bp and preferably at least 100 and most preferably 
more than 300 bp) within the E2b region of the adenovirus genome. An E2b deletion is a 
deletion thai prevents expression of at least one E2b gene product and encompasses deletions 
within exons encoding portions of E2b-specific proteins as well as deletions within promoier 
and leader sequences. 
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An "adenoviiWl.nichromosome" refers to a linear mWule of DNA containinii the 
Ad ITRs on each end which is generated from a plasmid containing the ITRs and one or more 
gene of mterest. The term "encapsidated adenovirus mmichromosome" or "EAM" refers to an 
adenovirus mmichromosome which has been packaged or encapsidated into a viral particle: 
plasmids containing the Ad ITRs and the packaging signal arc shown herein to produce 
EAMs. When used herein, "recovering" encapsidated ade^o^■irus minichromosomes refers to 
the collection of EAMs from a cell containing an EAM plasmid and a helper virus; this cell 
will direct the encapsidation of the mmichromosome to produce EAMs. The EAMs may be 
recovered from these cells by lysis of the cell (e.g., freeze-thawing) and pelleting of the cell 
debris to a cell extract as described in E.xample I (Ex. 1 describes the recovery of Ad virus 
from a cell, but the same technique is used to recover EAMs from a cell). "Purifying" such 
minichromosomes refers to the isolation of the recovered EAMs in a more concentrated form 
(relative to the cell lysate) on a density gradient as described in Example 7: purification of 
recovered EAMs permits the physical separation of the EAM from any helper virus (if 
present). 

The term "transfection" as used herein refers to the introduction of foreign DNA into 
eukaryotic cells. Transfection may be accomplished by a variety of means known to the art 
including calcium phosphate-DNA co-precipitation. DEAE-dextran-mediated transfection. 
poiybrene-mediated transfection. eiectroporation. microinjection, liposome fusion, lipofection. 
protoplast fusion, retroviral infection, and biolistics. 

The term "stable transfection" or "stably transfected" refers to the introduction and 
integration of foreign DNA into the genome of the transfected cell. The term "stable 
transfectant" refers to a cell which has stably integrated foreign DNA into the genomic DNA. 

As used herein, the term "gene of interest" refers to a gene inserted into a vector or 
plasmid whose expression is desired in a host cell. Genes of interest include genes having 
therapeutic value as well as reporter genes. A variety of such genes are contemplated, 
including genes of interest encoding a protein which provides a therapeutic function (such as 
the dystrophin gene, which is capable of correcting the defect seen in the muscle of MD 
patients), the utrophin gene, the CFTR gene (capable of correcting the defect seen in cystitic 
fibrosis patients), etc.. 

The term "reporter gene" indicates a gene sequence that encodes a reporter molecule 
(including an enzyme). A "reporter molecule" is detectable in any detection system, 
including, but not limited to enzyme (e.g:. ELISA. as well as enzyme-based histochemical 
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assavs), tluoresceni. raoSSctive. and luminescent systems. In oiS^mbodiment. the present 

invention contemplates the E. coli P-galactosidase gene (available from Pharmacia Biotech. 

Pistacata\va\ . NJ). green lluorescenl protein (GFP) (commercially available tVom Clontech. 

Paio Alio. CA). the human placental alkaline phosphatase gene, the chloramphenicol 

acetvliransierase (CAT) gene: other reporter genes are known to the art and may be 

employed. 

As used herein, the terms "nucleic acid molecule encodini:," "DNA sequence 
encoding." and "DNA encoding" refer to the order or sequence of deoxyribonucieoiides along 
a strand of deoxyribonucleic acid. The order of these deoxyribonucleotides determines the 
order of amino acids along the polypeptide (protein) chain. The DNA sequence thus codes 
for the amino acid sequence. 

DESCRIPTION OF THE INVENTION 

The present invention provides improved adenovirus vectors for the delivery of 
recombinant genes to cells in vitro and in vivo. As noted above, the present invention 
contemplates two approaches to improving adenovirus vectors. The first approach generally 
contemplates a recombinant plasmid containing the minimal region of the Ad genome 
required for replication and packaging (i.e., the left and right ITR and the packaging or 
sequence) along with one or more genes of interest; this recombinant plasmid is packaged into 
an cncapsidated adenovirus minichromosome (EAM) when grown in parallel with an El- 
deleted helper virus in a cell line expressing the El proteins (e.<^.. 293 cells). The 
recombinant adenoviral minichromosome is preferentially packaged. To prevent the 
packaging of the helper virus, a helper virus containing loxP sequences Hanking the T 
sequence is employed in conjunction with 293 cells expressing Cre recombinase; Cre-loxP 
mediated recombination removes the packaging sequence form the helper genome thereby 
preventing packaging of the helper during the production of EAMs. In the second approach, 
"damaged" or "deleted" adenoviruses containing deletions within the E2b region are 
employed. While the "damaged" adenovirus is capable of self-propagation in a packaging cell 
line expressing the appropriate E2b protein(s), the E2b-deleted recombinant adenovirus are 
incapable of replicating and expressing late viral gene products outside of the packaging cell 
line. 

In one embodiment, the self-propagating recombinant adenoviruses containing 
deletions in the E2b reeion of the adenovirus uenome. In another embodiment, "gutted'* 
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viruses are contemplaWthese viruses lack all viral coding: relWs. In addition, packagins 
cell lines co-expressing El and E2b uene products are provided which allow the production of 
infectious recombinant virus containing deletions in the El and E2b regions without the use 
of helper virus. 

The Description of the Invention is divided into the following sections: I. Self- 
Propagatmg Adenovirus Vectors: II. Packaging Ceil Lines: and III. Encapsidated .Adenoviral 
Minicliromosomes. 

I. Self-Propagating Adenovirus Vectors 

Self-propagating adenovirus (Ad) vectors have been e.xtensively utilized to deliver 
foreign genes to a great variety of cell types in vitro and in vivo. "Self-propagating viruses- 
are those which can be produced by transfection of a single piece of DNA (the recombinant 
\ iral genome) into a single packaging cell line to produce infectious virus: self-propagating 
viruses do not require the use of helper virus for propagation. 

E.xisting Ad vectors have been shown to be problematic in vivo. Tliis is due in part 
because current or first generation Ad vectors are deleted for only the early region I (El) 
genes. These vectors are crippled in their ability to replicate normally without the trans- 
complementation of El functions provided by human 293 cells, a packaging cell line (ATCC 
CRL 1573: Graham ci al. (1977) J. Gen. Virol. 36:59]. Unfortunately, with the use of high 
litres of El deleted vectors, and the fact that there are El -like factors present in many cell 
types. El deleted vectors can overcome the block to replication and e.xpress other viral gene 
products [Imperiale et al. (1984) Mol. Cell Biol. 4:867: Nevins (1981) Cell 26:213: and 
Gaynor and Berk (1983) Cell 33:683]. The e.xpression of viral proteins in the infected target 
.cells elicits a swift host immune response, that is largely T-cell mediated [Yang and Wilson 
,,(,1995) J. Immunol. 155:2564 and Yang et al. (1994) Proc. Natl. Acad. Sci. USA 91:4407]. 
The transduced cells are subsequently eliminated, along with the transferred foreign gene. In 
immuno-incompetent animals. Ad delivered genes can be expressed for periods of up to one 
:year [Yang et al. (1994). supra: Vincent ei al. (1993) Nature Genetics 5:130: and Yang et al. 
(1995) Proc. Natl. Acad. Sci. USA 92:7257]. 

Another shortcoming of first generation Ad vectors is that a single recombination 
event between the genome of an Ad vector and the integrated El sequences present in 293 
cells can generate replication competent Ad (RCA), which can readily contaminate viral 
stocks. 
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in order to I'ur^^cripple viral protein expression. and^SI) to decrease the tVequenc\ 

of ireneraiinu RCA. the present invention provides Ad vectors containing deletions in the E2b 
region. Propagation of these E2b-deleted Ad vectors requires cell lines which express the 
deleted E2b gene products. The present invention provides such packaging cell lines and for 
the first lime demonstrates thai the E2b gene products. DNA polymerase and preterminal 
protein, can be consiituiively expressed in 293 cells along with the El gene products. With 
every gene that can be constiiutively expressed in 293 cells comes the opponunitv to generate 
new \'ersions of Ad vectors deleted for the respective genes. This has immediate benefits; 
increased carrying capacity, since the combined coding sequences of the polymerase and 
preterminal proteins that can be theoretically deleted approaches 4.6 kb and a decreased 
incidence of RCA generation, since two or more independent recombination events would be 
required to generate RCA. Therefore, the novel El, Ad polymerase and preterminal protein 
expressing cell lines of the present invention enable the propagation of Ad vectors with a 
carrying capacity approaching 13 kb. without the need for a contaminating helper virus 
[Mitani ei ul. (1995) Proc. Natl. Acad. Sci. USA 92:3854]. in addition, when genes critical 
to the viral life cycle are deleted (e.g., the E2b genes), a further crippling of Ad to replicate 
and express other viral gene proteins occurs. This decreases immune recognition of virally 
infected cells, and allows for extended durations of foreign gene expression. The most 
important attribute of El, polymerase, and preterminal protein deleted vectors, however, is 
their inability to express the respective proteins, as well as a predicted lack of expression of 
most of the viral structural proteins. For example, the major late promoter (MLP) of Ad is 
responsible for transcription of the late structural proteins LI through L5 [Doertler. In 
AUcnovims DNA, The Viral Genome and Its Expression (Martinus Nijhoff Publishing Boston. 
1986)]. Though the MLP is minimally active prior to Ad genome replication, the rest of the 
late genes get transcribed and translated from the MLP only after viral genome replication has 
occurred [Thomas and Mathews (1980) Cell 22:523]. This cis-dependent activation of late 
gene transcription is a feature of DNA viruses in general, such as in the growth of polyoma 
and SV-40. The polymerase and preterminal proteins are absolutely required for Ad 
replication (unlike the E4 or protein IX proteins) and thus their deletion is extremely 
detrimental to Ad vector late gene expression. 
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11. Packal^f Cell Lines Constitutively Expressi^^2b Gene 
Products 

The present invention addresses the limitations of current or first generation Ad 
xectors by isolating novel 293 cell lines coexpressing critical viral gene functions. The 
5 present m\*ention describes the isolation and characterization of 293 cell lines capable of 
constitutively expressing the Ad polymerase protein. In addition, the present mvention 
describes the isolation of 293 cells which not only express the EI and polvmerase proteins, 
but also the Ad-preterminal protein. The isolation of cell lines coexpressing the El. Ad 
polymerase and preterminal proteins demonstrates that three genes critical to the life cycle of 
10 Ad can be constitutively coexpressed. without toxicity. 

In order to delete critical genes from self-propagating Ad vectors, the proteins encoded 
. by the targeted genes have to first be coexpressed in 293 cells along with the El proteins. 
Therefore, only those proteins which are non-ioxic when coexpressed constitutively (or toxic 
proteins inducibly-expressed) can be utilized. Coexpression in 293 cells of the El and E4 
15 genes has been demonstrated (utilizing inducible, not constitutive, promoters) [Yeh el oL 

(1996) J. Virol. 70:559; Wang et ai (1995) Gene Therapy 2:775; and Gorziglia et ui (1996) 
J. ViroL 70:4173]. The El and protein IX genes (a virion structural protein) have been 
coexpressed [Caravokyri and Leppard (1995) J. Virol. 69:66271, and coexpression of the El, 
E4. and protein IX genes has also been described [Krougliak and Graham (1995) Hum. Gene 
20 Ther. 6:1575]. 

The present invention provides for the first time, cell lines coexpressing El and E2b 
gene products. The E2b region encodes the viral replication proteins which are absolutely 
required for Ad genome replication [Doerfler. supni and Pronk ei al. (1992) Chromosoma 
. 102:S39-S45J. The present invention provides 293 cells which constitutively express the 140 
25 : _ kD Ad-polymerase. While other researchers have reported the isolation of 293 cells which 
express the Ad-preterminal protein utilizing an inducible promoter [Schaack et ai (1995) J. 
Virol. 69:4079], the present invention is the first to demonstrate the high-level, constitutive 
„: coexpression of the El. polymerase, and preterminal proteins in 293 cells, without toxicity. 
These novel cell lines permit the propagation of novel Ad vectors deleted for the El, 
30 polymerase, and preterminal proteins. 
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The preseni invention also provides encapsidated adenovirus minichromosome (EAM) 
consistinii of an infectious encapsidated linear genome containing Ad origins of replication, 
packaging signal elements, a reporter gene (e.g., a [J-galactosidase reporter gene cassette) and 
a gene of interest (e.g., a full length (14 kb) dystrophin cDNA regulated by a muscle specific 
enhancer/promoter). EAMs are generated by cotranstecting 293 cells with supercoiled 
plasmid DNA (e.g., pAd5[3dys) containing an embedded inverted origin of replication (and the 
remaining above elements) together with linear DNA from El -deleted virions expressing 
human placental alkaline phosphatase (hpAP) (a helper virus). All proteins necessary for the 
generation of EAMs are provided in tram from the hpAP virions and the two can be 
separated from each other on equilibrium CsCl gradients. These EAMs are useful for gene 
transfer to a variety of cell types both in vitro and in vivo. 

Adenovirus-mediated gene transfer to muscle is a promising technology for gene 
therapy of Duchenne muscular dystrophy (DMD). However, currently available recombinant 
adenovirus vectors have several limitations, including a limited cloning capacity of -8.5 kb. 
and the induction of a host immune response that leads to transient gene expression of 3 to 4 
weeks in immunocompetent animals. Gene therapy for DMD could benefit from the 
development of adenoviral vectors with an increased cloning capacity to accommodate a full 
length (-14 kb) dystrophin cDNA. This increased capacity should also accommodate gene 
regulatory elements to achieve expression of transduced genes in a tissue-specific marmer. 
Additional vector modifications that eliminate adenoviral genes, expression of which is 
associated with development of a host immune response, might greatly increase long term 
expression of virally delivered genes in vivo. The constructed encapsidated adenovirus 
minichromosomes of the present invention are capable of delivering up to 35 kb of non-viral 
exogenous DNA. These minichromosomes are derived from bacterial plasmids containing 
two fused inverted adenovirus origins of replication embedded in a circular genome, the 
adenovirus packaging signals, a Pgalactosidase reporter gene and a full length dystrophin 
cDNA regulated by a muscle specific enhancer/promoter. The encapsidated minichromosomes 
are propagated in vitro by rram-complementation with a replication defective (EKES deleted) 
helper virus. These minichromosomes can be propagated to high titer (>10Vml) and purified 
on CsCl gradients due to their buoyancy difference relative to helper virus. These vectors are 
able to transduce myogenic cell cultures and express dystrophin in myotubes. These results 
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demoiisirate that enca^ffiated adenovirus minichromosomes are useful for gene transfer to 
muscle and other tissues. 

The present invention funher provides methods for modifyint: the above-described 
EAM s>stem to enable the generation of high titer slocks of EAMs with minimal helper virus 
contamination. Preferably the EAM stocks contain helper virus representing less than 1%. 
.preferably less than 0.1% and most preferably less than 0.01% (including 0.0%) of the final 
viral isolate. 

The amount of helper virus present in the EAM preparations is reduced in two ways. 
The first is by selectively controlling the relative packaging efficiency of the helper virus 
versus the EAM virus. The Cre-loxP excision method is employed to remove the packaging 
signals from the helper virus thereby preventing the packaging of the helper virus used to 
provide in trans viral proteins for the encapsidation of the recombinant adenovirus 
minichromosomes. The second approach to reducing or eliminating helper virus in EAM 
stocks IS the use of improve physical methods for separating EAM from helper virus. 

15 

EXPERIMENTAL 

The following examples serve to illustrate cenain preferred embodiments and aspects 
of the present invention and are not to be construed as limiting the scope thereof 

In the e.xperimental disclosure which follows, the following abbreviations apply: M 

20 (molar); mM (millimolar): nM (micromolar): mol (moles); mmol (millimoles); Mmoi 
(micromoles): nmol (nanomoles); mu or m.u. (map unit); g (gravity): gm (crams): me 
(milligrams): ng (micrograms): pg (picograms); L (liters): ml (milliliters): ^1 (microliters); cm 
(centimeters): mm (millimeters): urn (micrometers); nm (nanometers): hr (hour); min 
-.(minute): msec (millisecond); 'C (degrees Centigrade): AMP (adenosine 5 -monophosphate); 

25 .;cDNA (copy or complimentary DNA): DTT (dithiotheritol): ddH,0 (double distilled water); 
dNTP (deoxyribonucleotide triphosphate); rNTP (ribonucleotide triphosphate); ddNTP 
(dideoxyribonucleotide triphosphate); bp (base pair); kb (kilo base pair); TLC (thin layer 
•chromatography); tRNA (transfer RNA); nt (nucleotide); VRC (vanadyl ribonucleoside 
complex): RNase (ribonuclease); DNase (deoxyribonuclease); poly A (polyriboadenylic acid); 

30 PBS (phosphate buffered saline); OD (optical density); HEPES (N-[2- 

Hydroxyethyllpiperazine-N-[2-ethanesulfonic acidl); HBS (HEPES buffered saline); SDS 
(sodium dodecyl sulfate); Tris-HCl (tris(Hydroxymethyl]aminomethane-hydrochloride); rpm 
(revolutions per minute): ligation buffer (50 mM Tris-HCI. 10 mM MgCU. 10 mM 
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dilhiothreitoL 25 i^tii/mTbovine serum albumin, and 26 uM NAD-^. and pM 7.8): EGTA 
{ethylene glycol-bist P-aminoethyl ether) N. N. N". N'-teiraacetic acid): EDTA 
(ethylenediaminetetracetic acid): ELISA (enzyme linked immunosorbani assay): LB (Luria- 
Bertani broth: 10 iz trypione. 5 g yeast extract, and 10 g NaCl per liter. pH adjusted lo 7.5 
with IN NaOH): superbrolh (12 g tryptone. 24 g yeast extract. 5 g glycerol. 3.8 g KH.POj 
and 12.5 g. IC.HPOj per liter): DMEM (Dulbecco's modified Eagle's medium): ABI (Applied 
Biosysiems Inc.. Foster City. CA); Amersham (Amersham Corporation. Arlington Heights. 
ID: ATCC (American Type Culture Collection, Rockville. MY): Beckman (Beckjnan 
Instruments Inc.. Fullerton CA); BM (Boehringer Mannheim Biochemicals. Indianapolis. IN): 
Bio-lOl (Bio-101. Vista. CA): BioRad (BioRad. Richmond, CA); Brinkmann (Brinkmami 
Instruments Inc. Wesbury. NY); BRI., Gibco BRL and Life Technologies (Bethesda Research 
Laboratories. Life Technologies Inc.. Gaithersburg. MD): CRI (Collaborative Research Inc. 
Bedford. MA); Eastman Kodak (Eastman Kodak Co.. Rochester. NY); Eppendorf (Eppendorf 
Eppendorf North America. Inc.. Madison, WI); Falcon (Becton Dickenson Labware, Lincoln 
Park. NJ): IBI (International Biotechnologies, Inc., New Haven. CT); ICN (ICN Biomedicals. 
Inc.. Costa Mesa. CA): Invitrogen (Invitrogen. San Diego, CA); New Brunswick (New 
Brunswick Scientific Co. Inc.. Edison, NJ); NEB (New England BioLabs Inc.. Beverly. MA); 
NEN (Du Pont NEN Products. Boston, MA); Pharmacia (Pharmacia LKB Gaithersburg, MD); 
Promega (Promega Corporation. Madison. WI); Stratagene (Straiagene Cloning Systems, La 
Jolla. CA): UVP (UVP, Inc.. San Gabreil. CA); USB (United States Biochemical Corp.. 
Cleveland. OH); and Whatman (Whatman Lab. Products Inc. Clifton. NJ). 

Unless otherwise indicated, all restriction enzymes and DNA modifying enzymes were 
obtained from New England Biolabs (NEB) and used according to the manufacturers 
directions. 



EXAMPLE 1 



Generation Of Packaging Cell Lines That Coexpress 



The Adenovirus El And DNA Polymerase Proteins 



In this example, packaging cell lines coexpressing Ad El and polymerase proteins 
were described. These cell lines were shown to support the replication and growih of H5ts36, 
an Ad with a temperature-sensitive mutation of the Ad polymerase protein. These 
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polymerase-expressin>^ckacinu cell lines can be used to prepare Ad vectors deleted for the 
El and polymerase functions. 



a) Tissue Culture And Virus Growth 

5 LP-293 cells (Microbix Biosystems. Toronto) were urown and serially passaged as 

suggested by the supplier. 

Plaque assays were performed in 60 mm dishes containing cell monolayers at ~ 90% 
coniluencN . The appropriate virus dilution in a 2% DMEM solution was dripped onto the 
cells, and the plates incubated at the appropriate temperature for one hour. The virus 

!0 containing media was aspirated, the monolayer was overlaid with 10 mis of a pre-warmed 
EMEM agar overlay solution (0.8% Noble agar. 4% fetal calf serum, and antibiotics) and 
allowed to solidify. After the appropriate incubation time (usually 7 days for incubations at 
38.5°C and 10-12 days for incubations at 32°C). live mis of the agar-containing solution 
contaming i.3% neutral red was overlaid onto the infected dishes and plaques were counted 

15 the next day. An aliquot of the virus H5ts36 [Freimuth and Ginsberg (1986) Proc. Natl. 

Acad. Sci. USA 83:7816] was utilized to produce high titre stocks after infection of 293 cells 
at 32°C. H5ts36 is an Ad5-derived virus defective for viral replication at the nonpermissive 
temperature [Miller and Williams (1987) J. Virol. 61:3630]. 

The infected cells were harvested after the onset of extensive cytopathic effect, 

20 pelleted by centrifugation and resuspended in 10 miM Tris-CI. pH 8.0. The lysate was freeze- 
ihaued three times and centrifuged to remove the cell debris. The cleared lysate was applied 
to CsCl: step gradients (heavy CsCl at density of 1.45 g.'ml. the light CsCI at density of 1.20 
g./ml). ultracentrifuged. and purified using standard techniques [Graham and Prevec (1991) in 
Methods in Molecular Biology. Vol 7. Gene Transfer and Expression Protocols. Murray 

25 (ed.). Humana Press, Clifton. NJ. pp. 109-128]. The concentration of plaque forming units 
(pfu) of this stock was determined at 32°C as described above. Virion DNA was extracted 
from the high titre stock by pronase digestion, phenol-chloroform extraction: and ethanol 
precipitation. The leakiness of this stock was found to be <l in 2000 pfu at the non- 
permissive temperature, consistent with previous reports [Miller and Williams (1987). supra]. 

30 

b) Plasmids 

The expression plasmid pRSV-pol [Zhao and Padmanabhan (1988) Cell 55:1005] 
contains sequences encoding the Ad2 polymerase mRNA (including the start codon from the 
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exon ai map unit 39) under the transcriptional control of the Rous Sarcoma Virus 
LTRy'promoier element: the Ad2 DNA polymerase sequences are tlanked on the 3' end bv ihe 
SV-40 small t intron and SV-40 polyadenylaiion addition site (see Fig. lA). Fiuure I A 
provides a schematic representation of the Ad polymerase expression plasmid pRSV-pol. 
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5 pRSV-pol includes the initiator methionine and amino-terminai peptides encoded by the exon 
at m.u. 39 of the Ad genome. The location of the PCR primers p602a and p2158c. the two 
Seal I kb probes utilized for Northern analyses, the polymerase terminator codon, and the 
polyadenylation site of the IVa2 gene (at 11.2 m.u. of the Ad genome) are indicated. 

The expression plasmid pRSV-pTP [Zhao and Padmanabhan ( I 988). supra] contains 

10 sequences encoding the Ad2 preterminal protein (including the amino terminal peptides 

encoded by the exon at map unit 39 of the Ad genome) under the transcriptional control of 
the Rous Sarcoma Virus LTR/promoter element; the pTP sequences are flanked on the 3' end 
by the SV-40 small-t intron and SV-40 polyadenylation signals (see Figure 1 B for a schematic 
of pRSV-PTP). Figure IB also shows the location of the EcoRV subfragment utilized as a 

15 probe in the genomic DNA and cellular RNA evaluations, as well as the initiator methionine 
codon present from map unit 39 in the Ad5 genome. The locations of the H5/>7l90 and 
H5siih\00 insertions are shown relative to the preterminal protein open-reading frame. The 
following abbreviations are used in Figure 1: ORF. open reading frame: small-t. small tumor 
antigen, and m.u.. map unit. 

20 pCEP4 is a plasmid containing a hygromycin expression cassette (Invitrogen). 

pFG140 (Microbix Biosystems Inc. Toronto. Ontario) is a plasmid containing 
sequences derived from Ad5£//309 which contain a deletion/substitution in the E3 region 
[Jones and Shek (1979) Cell 17:683]. pFG140 is infectious in single transfeciions of 293 
cells and is used as a control for transfection efficiency. 



c) 



Transfection Of 293 Cells 



30 



LP-293 cells were cotransfected with DamHl linearized pRSV-pol and BamHl 
linearized pCEP4 at a molar ratio of 10:1 using a standard CaPOj precipitation method 
[Sambrook ct al. (1989) Molecular Cloning: A Laboratory Manual. Cold Spring Harbor 
Press. Plainview. NY. pp. 16.33-16.36]. In addition. 293 cells were cotransfected with or with 
DamHl linearized pRSV-poL BamHl linearized pRSV-pTP and pCEP using a molar ratio of 
10:1 (non-selectable:selectable plasmids). 
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Forty-eighl hml^ alter iranslection. the cells were passaged into media containing 
hygromycin at 100 Mg/mL. individual hygromycin resistant colonies were isolated and 
expanded. 



d) Isolation Of Ad Polymerase Expressing 293 Cell Lines 

Twenty hygromycin resistant ceil lines were expanded and screened for the ability to 
express the Ad polymerase protein. Initially, the individual cell lines were assayed for the 
ability to support growth ol" the viral polymerase mutant H5ts36 using the plaque assay 
described in section a) above. It was speculated that constitutive expression of the wild type 
Ad polymerase protein in a clonal population of 293 cells should allow the growih of H5ts36 
at 38.5°C. However, it was unclear if constitutive expression of the Ad polymerase would be 
toxic when coexpressed with the El proteins present in 293 cells. Similar toxicity problems 
have been observed with the Ad ssDBP. and the pTP [Klessig ci al. (1984) Mol. Cell. Biol. 
4; 1354 and Schaack ei al. (1995) J. Virol. 69:4079). 

Of the twenty hygromycin resistant cell lines isolated, seven were able to support 
plaque formation with H5ts36 at the non-permissive temperature, unlike the parental LP-293 
cells: these cell lines were named B-6. B-9. C-1. C-4. C-7. C-13 and C-14 (see Table I ) (the 
B-6 and B-9 cell lines received only the pRSV-pol and CEP4 plasmids: C-I. C-4. C-7. C-13 
and C-14 cell lines received the pRSV-pol. pRSV-pTP and CEP4 plasmids). For the results 
shown in Table i. dishes (60 mm) of near confluent cells of each cell line were infected with 
the same dilution of H5ts36 at the temperature indicated, overlaid with agar media, and 
stained for plaques as outlined in section a. Passage number refers to the number of serial 
passages after initial transfection with the plasmid pRSV-pol. 

The cell line B-6 produced plaques one day earlier than the other Ad polvmerase- 
expressing cell lines, which may reflect increased polymerase expression (see below). Cell 
line B-9 demonstrated an increased doubling time whereas each of the other cell lines 
displayed no growth disadvantages relative to the parental LP-293 cells. As shown in Table 
1. even after multiple passages (in some instances up to four months of serial passaging) the 
cells were still capable of H5ts36 plaque formation at the non-permissive temperature, 
indicating that the RSV-LTR/promoter remained active for extended periods of time. 
However, the cell lines B-9 and C-13 displayed a decreased ability to plaque the virus at 32°C 
as well as at 38.5°C. suggesting that a global viral complementation defect had occurred in 
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these cell lines after extended passaging. The remaining cell lines screened at later passaces 
demonstrated no such defect, even after 20 passages (e.g.. cell line B-6. Table \ ). 

TABLE 1 



Plnquing Ability Of H5is36 Ai The Non-Permissive Temperaiure Utilizing 293-Ad Polymerase Exprcssms 

Cell Lines 



Cell Line 


Passage Number 


iNumbcr Of Plaques At: 


J2.U C 


38.5°C 


293 






0 


B-6 


Q 


>500 


>500 


— u 


>500 


>500 


8-9 




>500 


>500 


1 4 


90 


18 


C-l 


5 




500 


1 3 


>500 


500 


C-4 


5 


>500 


>500 


1 A 


500 


500 


C-7 


5 


>500 


>500 


14 


>500 


^500 


C-l 3 


5 


>500 


-500 


27 . 


120 


4 


C-14 


5 


>500 


500 


27 


>500 


370 



c) Genomic Analysis Of Ad Polymerase-Expressing Cell Lines 

Genomic DNA from LP-293 cells and each of the seven cell lines able to complement 
H5ts36 at 38.5*^0 were analyzed by PCR for the presence of pRSV-pol derived sequences. 
Genomic DNA from LP-293 cells and the hygromycin resistant cell lines were harvested 
using standard protocols (Sambrook e( ai. supra) and 200 ng of DNA from each cell line was 
analyzed by PCR in a solution containing 2 ng/mL of primers p602a and p2158c. 10 mM 
TrisHCl. pH 8.3. 50 mM KCl, 1.5 mM MgCU, and 0.001% gelatin. The forward primer. 
p602a [5'-TTCATTTTATGTTTCAGGTTC AGGG-3' (SEQ ID NO:2)] is located in the SV- 
40 polyadenylation sequence. The reverse primer p2158c [5'- 

TTACCGCCACACTCGCAGGG-3' (SEQ ID N0:3)] is Ad-sequence specific with the 5' 
nucleotide located at position 3394 of the Ad 5 genome [numbering according to Doertler 
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( 1986) AJcnovinis The Viral Genome ami lis Expression. Niihotl". Boston. MA. pp |. 

95|. 

PCR was performed with a Perkin Elmer 9600 Thermocycler utilizing the following 
cycling parameters: initial denaturation at 94°C for 3 mm. 3 cycles of denaturation ai 94°C 
5 for 30 sec. annealing at 50X for 30 sec. and extension at 72°C for 60 sec. followed by 

another 27 cycles with an increased annealing temperature at 56°C. with a final extension at 
_ 72°C for 10 minutes. PCR products were separated on a 1.0 % agarose gel and visualized 
with ethidium bromide staining (Figure 2). A 1 kb ladder (Gibco-BRl) was used as a size 

and the plasmid pRSV-pol was used as a positive control. The ~750 bp PCR products 
10 „ are indicated by an arrow in Figure 2. 

As shown in Figure 2. all cell lines capable of H5ts36 plaque formation at 38.5°C 
- contained the Ad pol DNA sequences, whereas the LP-293 cells did not yield any 

amplification product with these primers. This result demonstrates that each of the selected 
cell lines stably co-integrated not only the hygromycin resistance plasmid pCEP4. but also 
15 pRSV-pol. 

0 Complementation Of The Replication Defect Of H5ts36 By 
Ad Polymerase Expressing Cell Lines 

The C to T transition at position 7623 of the H5ls36 genome alters the DNA binding 
affinity of the Ad polymerase protein, rendering it defective for viral replication at non- 
permissive temperatures [Chen ei al. (1994) Virology 205:364: Miller and Williams (1987) J. 
Virol. 61:3630: and Wilkie e, al. (1973) Virology 51:499]. To analyze the functional activity 
of the Ad polymerase protein expressed by each of the packaging cell lines, a viral 
replication-complementation assay was performed. LP-293 cells or the hygromycin resistam 
cell lines were seeded onto 60 mM dishes at densities of 2.5-3.0 x lO^'per dish, infected with 
H5ts36 at a multiplicity of infection (MOl) of 10. and incubated for 24 hours at 38.5°C. or 48 
hours at 32°C. Total DNA was harvested from each plate, then 2 ^g of each sample were 
digested with HindlW. separated on a 1.0% agarose gel. transferred to a nylon membrane, and 
hybridized with -^-P-labeJed H5ts36 virion DNA. Densitometnc analysis of the 8.010 bp 
HindWX fragment in each lane was performed on a phophoroimager (Molecular Dynamics) 
utilizing a gel image processing .system (IP Lab Version 1.5. Sunnyvale. CA). 



20 



25 
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The resulting auioradiograph is shown in Figure 3. In Figure 3. ihe lane marked 
"Std." (standard) contains I fag of ///ndlll-dicesied H5is36 virion DNA. The 8.010 bp 
HindUl tVagmenis analyzed by densitometry are indicated by an arrow. 

As shown in Figure 3. H5ls36 had a diminished ability to replicate in LP-293 cells at 
5 the non-permissive temperature. In contrast, all seven of the previously selected cell lines 

were able to support replication of H5ls36 virion DNA at 38.5°C to levels approaching those 
occurring m LP-293 cells at 32°C. 

A densitometric analysis of the amount of H5ts36 viral DNA replicated in each of the 
cell lines at permissive and nonpermissive temperatures is presented in Table 2 below. For 
10 this assay, the relative amounts of the 8.010 bp HindlW fragment were compared. The 

relative levels of H5ts36 virion DNA replication determined by densitometric analysis of the 
8.010 bp HindlU fragment isolated from each of the cell line DNA samples. The surface area 
of the 8.010 bp fragment in 293 cells incubated at 38.5°C was designated as 1. and includes 
some replicated H5ts36 virion DNA. The numbers in each column represent the ratio 
15 between the densities of the 8.010 bp fragment isolated in the indicated cell line and the 
density of the same band present in LP-293 cells at 38.5°C. 

As shown in Table 2, the levels of replication at the permissive temperature were all 
within four-fold of each other, regardless of which cell line was analyzed, but at the non- 
permissive temperature LP-293 cells reveal the H5ts36 replication defect. The viral bands 
20 that were present in the LP-293 DNA sample at 38. C represented input virion DNA as well as 
low level replication of H5ts36 DNA. which is generated due to the leakiness of the ts 
mutation at the high MOI utilized in this experiment. The Ad polymerase-expressing cell 
lines were all found to be capable of augmenting 
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TABLE 2 



15 



20 



Cell Line 


Ratios Of 8.010 bp HindlW Fragment Generated At: 


32.0°C 


38.5°C 


LP-293 


42.6 


LO 


B-6 


27.3 


57.2 


B-9 


69.0 


15.8 


C-i 


1 13.6 


34.1 


C-4 


100.2 


65.5 


C-7 


1 14.9 


75.0 


C-13 


43.1 


42.2 


C-i4 


46.7 


27.8 



H5ls36 yenome replication. Although one cell line (B-9) allowed H5ts36 replication to levels 
only 16 fold greater than LP-293 cells, this was the same cell line that was observed to 
display poor growth properties. Each of the remaining Ad polymerase-expressmg cell lines 
allowed substantially greater replication of H5ts36 at non-permissive temperatures, compared 
to LP-293 cells (Table 2). An enhancement of replication up to 75 fold above that of LP-293 
cells was observed with the cell line C-7 at 38.5°C. A substantially more rapid onset of viral 
cytopathic effect in Ad polymerase-expressing cell lines was observed at either temperature. 
These estimates of H5ts36 replication-complementation are conservative, since they have not 
been adjusted for the low level replication of H5ts36 at 38.5°C [Miller and Williams ( 1 987). 
supra]. The leakiness of the H5ts36 mutation could potentially be overcome with the use of a 
virus deleted for the Ad polymerase gene. 
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g) RNA Analysis Of Ad Poiymerase-Exprcssing Cell Lines 

Total RNA was extracted from each of the cell lines using the RNAzol method 
(Teltest. Inc.. Friendswood. TX 77546: Chomczynski and Sacchi (1987) Anal. Biochem. 
162.156]. Fifteen micrograms of RNA from each cell line was electrophoresed on a 0.8% 
agarose-formaldehyde gel. and transferred to a Nytran membrane (Schleicher & Schuell) by 
blotting. The filter was UV crosslinked. and analyzed by probing with the two "-P-labeled I 
kb Scu\ subfragments of Ad which span positions 6095-8105 of the Ad5 genome (see Figure 
I A). These two Seal subfragments of the Ad genome are complimentary to the 5' end of the 
Ad polymerase mRNA. The resulting autoradiograph is shown in Figure 4. In Figure 4. the 
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location of the smaller and larger species of Ad polymerase mKlNA are indicated relative to 
the 28S and 18S ribosomai RNAs. The aberrant transcript expressed by the B-9 cell line is 
indicated by an arrow. 

The results shown in Figure 4 revealed that the RNA derived from cell lines B-6 and 
5 C-7 contained two species of RNA. estimated to be -4800 and -7000 nt in length, while LP-293 
derived RNA had no detectable hybridization signal. The presence of two polymerase RNA 
species suggests that the polyadenylalion signal of the Ad IVa2 gene (present in the Ad-pol 
construct, see Fig. lA) is being utilized by the cell RNA processing machinery, in addition to 
the SV-40 polyadenylalion signal. Similar analysis of RNA derived from the cell lines C-1, 
10 C-4, C-13. and C-14 also detected the same two transcripts as those detected in the RNA of 
cell lines B-6 and C-7. but at decreased levels, suggesting that even low levels of Ad 
polymerase mRNA expression can allow for the efficient replication of polymerase mutants 
such as H5ls36. The cell line B-6 expressed high levels of polymerase transcript and can 
plaque H5is36 one day earlier than the other cell lines at 38.5°C. suggesting a causal 
15 relationship. It is interesting to note that the two polymerase transcripts are also detected in 
RNA isolated from cell line B-9, but substantial amounts of a larger RNA transcript (size > 
10 kb) is also present (see Figure 4). The high level production of the aberrant message may 
be related to the increased doubling time previously noted in this cell line. 

20 h) Transfectability Of Ad Polymerase-Exprcssing Cell Lines 

The ability of Ad polymerase-expressing 293 cell lines to support production of 
H5ts36 virions after transfection with H5ts36 genome DNA was examined as follows. 293 
cells as well as hygromycin resistant cell lines were grown to near confluency on 60 mm 
dishes and transfected with either 3 [xg of purified H5ts36 virion DNA, or with 3.5 |ig of the 
25 plasmid pFG140 (Microbix Biosystems). using the caiionic lipid Lipofectamine (Gibco-BRL). 

Cells that received the H5ts36 virion DNA were incubated at 32°C for 14 days, or 38.5°C for 
10 days. The pFG140 transfected cells were incubated at 37.5°C for 10 days. All plates 
were then stained with the neutral red agar overlay and plaques were counted the next day. 
The results are shown in Table 3. 
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TABLE 3 

Transtection Efnciency Of Ad pol-Expressiniz Cell Li 



Cell line 



LP-293 



B-6 



B-9 



C-l 



Number Of Plaques At: 



32.0°C 



>500 



-500 



n.d/ 



n.d. 



38.5°C 



-500 



n.d. 



-500 



10 



C-4 



C-7 



C-13 



C-14 



n.d. = not deiermined. 



n.d. 



>500 



n.d. 



>500 



n.d. 



100 



n.d. 



>500 



15 



The results shown in Table 3 demonsiraied that transfeciion of H5ts36 DNA at the 
non-permissive temperature allows for ample plaque production in all of the Ad polymerase- 
cxpressmg cell lines tested, unlike the parental LP-293 cells. Cell hne C-13 was at passage 
number 29. and demonstrated a somewhat decreased ability to generate plaques at this 
extended passage number. These same cell lines are also capable of producing plaques when 
transfecied with the plasmid pFGHO, a plasmid capable of producing mfectious. El 
dependent Ad upon transfection of the parental 293 cells [Ghosh-Choudhury (1986) Gene 
50:161]. These observanons suggest that the Ad polymerase expressmg cell lines should be 
useful lor the production of second generation Ad vectors deleted not only for the El genes, 
but also for the polymerase gene. As shown below in Examples 2 and 3, this is indeed the 
case. 



0 



EXAMPLE 2 

Isolation And Characterization Of Packaging Cell Lines That 
Coexpress The Adenovirus El. DNA Polymerase And Preterminal Proteins 

In E.xample 1. packaging ceil lines coexpressing Ad El and polymerase proteins were 
described. These cell lines were shown to support the replication and growth of H5ts36. an 
Ad with a temperature-sensitive mutation of the Ad polymerase protein. These polymerase- 
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expressme packaging"">Snl lines can be used to prepare Ad vebwf^ deleted tor the EI and 
polymerase functions. In this example. 293 cells cotransfecied with both Ad polvmerase and 
preterminal protein expression plasmids are characterized. Cell lines co-expressine the Ad EL 
polymerase and preterminal proteins can be used to prepare Ad vectors deleted for the El. 
polymerase and preterminal protein (pTP) functions. 

a) Tissue Culture And Virus Propagation 

The use of LP-293 cells (Microbix Biosystems Inc., Toronto). Ad-polvmerase 
expressing cell lines, and plaquing efficiency assays of Ad viruses was conducted as described 
in Example 1. All cells were maintained in 10% fetal bovine serum supplemented DMEM 
media (GIBCO) in the presence of antibiotics. The virus H5sub\00 [Freimuih and Ginsberg 
(1986) Proc. Natl. Acad. Sci. USA 83:7816] has a temperature sensitive (is) mutation caused 
by a three base pair insertion within the amino terminus of the preterminal protein, in addition 
to a deletion of the El sequences (see Figure IB). H5sub\00 was propagated and lilred at 
32.0°C in LP-293 cells: the leakiness of this stock was less than 1 per 1000 plaque-forming 
units (pfu) at the nonpermissive temperature of 38.5°C. A lower titer cell lysate contaning 
the virus H5//7 190 (which contains a !2 base-pair insertion wMthin the carboxy-terminus of the 
preterminal protein as well as a deletion of the El region, see Figure IB) was provided by Dr. 
P. Freimuth [Freimuth and Ginsberg (1986), supra]. The polymerase and preterminal protein 
expressing cell lines were always maintained in media supplemented with In gromycin 
(Sigma) at 100 |ig/mL. 

b) Isolation Of Ad Polymerase And Preterminal Protein 
Expressing 293 Cells 

The C-K C-4, C-7. C-13, and C-14 cell lines (Ex. 1), which had been coiransfected 
with pRSV-poi, pRSV-pTP and CEP4. were screened for presence of pTP sequences and for 
the ability to support the growth of H5ts36 (ts for the Ad-polymerase), H5//7190. and 
H5suh\00 using plaque assays as described in Example I. 
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i) ^Ralysis Of Genomic DNA And Cellular RNA 

Cell lines that had received the preterminal protein expression plasmid were screen tor 
the presence of pRSV-pTP sequences and El sequences. Total DNA was isolated from the 
LP-293. B-6 (transfected with pRSV-pol only) or C-7 (cotransfected with pRSV-pol and 
pRSV-pTP) cell lines, two micrograms ^g) of each DNA was codigested with the restriction 
enzymes Xhal and BamHl. electrophoretically separated in a 0.6% agarose gel. and 
transferred onto a nylon membrane. The membrane was UV crosslinked. probed with both a 
1.8 kb BM-.^Chal fragment (spans the El coding region) isolated from the plasmid pFGI40. 
and a 1.8kb EcoRy subfragmem of Ad serotype 5 (spans the preterminal protein coding 
sequences; see Fig. IB), both of which were random-primer radiolabeled w.th ^'P to a specific 
activity greater than 3.0 x 10" cpm/^g. The membrane was subsequently exposed to X-ray 
nim with enhancement by a fluorescent screen. The resulting autoradiograph is shown in 
Figure 5. 

In Figure 5. the preterminal specific sequences migrated as an -1 1.0 kb DNA fragment 
while the El containing band migrated as a 2.3 kb DNA fragment. No hybridization of either 
probe to DNA isolated from either the LP-293 or B-6 cell lines was observed As shown in 
Figure 5. only the C-7 cell line genomic DNA had preterminal coding sequences, unlike the 
parental LP-293 cells, or the Ad-polymerase expressing B-6 cells. In addition, all cell lines 
had El specific sequences present at nearly equivalent amounts, demonstrating that the 
selection design has not caused the loss of the EI sequences originally present in the LP-293 
cells. The results presented in Example I demonstrated that both the B-6 and C-7 cell lines 
contain polymerase specific sequences within their genomes, unlike the parental LP-293 cells. 

To confirm that transcription of preterminal protein was occurring, total RNA was 
isolated from each of the cell lines, transferred to nylon membranes, and probed to detect 
preterminal protein-specific mRNA transcripts as follows. Total cellular RNA was isolated 
from the respective cell lines and 15 ^ig of total RNA from each cell line was transferred to 
nylon membranes. The membranes were probed with the 1.8 kb EcoRV radiolabeled 
subfragment of Ad5 (see Fig. IB) complementary to the preterminal protein coding region. 
The resulting autoradiograph is shown in Figure 6. 

As shown in Figure 6. a single mRNA of the expected size (-3 kb in length) is detected 
only in RNA derived from the C-7 cell line. No hybridization was detected in lanes 
containing RNA derived from the LP-293 or B-6 cell lines. In Example 1. it was 
demonstrated that the C-7 cell line also expresses high levels of the Ad polymerase mRNA. 
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Thus, the C-7 cell linfe^nstitutiveiy expresses both the Ad pciT5^ierase and preterminal 
proiem niRNAs alone with El transcripts. 

ii) Piaquing Efficiency Of pTP Mutants On pTP- 
5 Expressing Cell Lines 

The C-7 cell line was screened for the ability to iranscomplemeni the gro\vth of 
preterminal mutant viruses. The virus H5/>;190 (contains a 12 base pair insenion located 
within the carboxy-terminus of the preterminal protein) has been shown to have a severe 
growth and replication defect, producing less than 10 plaque-forming units per cell [Freimulh 

10 and Ginsberg (1986) Proc. Natl. Acad. Sci. USA 83:7816]. The results are summarized in 

Table 4 below. For the results shown in Table 4, LP-293, B-6. or C-7 cells were seeded at a 
density of 2.0-2.5 x 10^ cells per plate. The cells were infected with limiting dilutions of 
lysates derived from the preterminal proiein-mutanl viruses H5//7 190 or H5.ii;6100. incubated 
at 38.5°C. and plaques counted after six days. As shown in Table 4. only the C-7 cell line 

15 could allow efficient plaque formation of H5/>7l90 at 38.5°C (the H5/>7l90 lysate used to 

infect the cells was of a low titer, relative to the high titer HSsuhXOO stock), while both the B- 
6 and C-7 cell lines had nearly equivalent piaquing efficiencies when H5.S7/6100 was utilized 
as the infecting virus. 

TABLE 4 

20 Piaquing Efficiency Of Preterminal Protein-Muiant Viruses 



Virus 


Mutation 
Location 


Plaque Titres (pfu/ml) 


LP-293 


B-6 


07 


H5^A7l90 


carboxy-ierminus 


<\ X 10' 


<1 X 10- 


1.4x1 0' 


H>suh\00 


amino-terminus 


<l X 10' 


9.0 X 10' 


4.5 X 10* 



25 As shown in Table 4. when equivalent dilutions of H5//?I90 were utilized, the piaquing 

efficiency of the C-7 cell line was at least 100-fold greater than that of the B-6 or LP-293 
cells. This result demonstrated that the C-7 cell line produces a functional preterminal 
protein, capable of trans-complementing the defect of the H5//2190 derived preterminal 
protein. 

30 The cell lines were next screened for the ability to trans-complement with the 

temperature-sensitive virus, H5.s'w6100, at nonpermissive temperatures, H5suh\Q0 has a codon 
insertion mutation within the amino-terminus of the preterminal protein, as well as an El 
deletion. The mutation is responsible both for a temperature sensitive growth defect, as well 
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as a replication defe^reimuih and Ginsberg (1986). w/v/^^Schaack ei at. (1995) j, 
Virol. 69:4079], The piaquing efficiency of the ceil line C-7 was found to be at least 1000 
fold greater than that of the LP-293 ceils (at non-perm.ssive temperatures) (see Table 4). 
Interestingly, the cell line B-6 was also capable of producing large numbers of H5sub\QQ 
derived plaques at 38.5°C. even though it does not express any preterminal protein. This 
result suggested that the high level expression of the polymerase protein was allowmg plaque 
formation of HSsi.b\m in the B-6 cell line. To examme this possibility, the nature of 
H5.vi/Al00 growth in the various cell lines was examined. 

c) Complementation Of The Replication And Growth Defects 
Of HS^mAIOO 

The cell lines B-6 and C-7 were shown to overcome the replication defect of H5ts36 
(Ex. 1 ). Since the preterminal and polymerase proteins are known to physically interact with 
each other [Zhao and Padmanabhan (1988) Ceil 55:1005]. we investigated whether the 
expres.sion of the Ad-polymerase could overcome the replication defect of HSsub\00 at non- 
permissiNe temperatures using the following replication-complementation assay. 

LP-293. B-6. or C-7 cells were seeded onto 60 mM dishes at a density of 2x10* cells 
per dish and infected the next day with H5iw6I00 at a multiplicity of infection (MOI) of 0.25, 
and incubated at 38.5''C for 16 hours or 32.0°C for 40 hours. The cells from each infected 
plate were then harvested and total DNA extracted as described in Example I. Four 
micrograms of each DNA sample was digested with HindUL elecirophoresed through a 0.7% 
agarose gel. transferred to a nylon membrane, and probed with '-P-labeled H5ts36 virion 
DNA. The resulting autoradiograph is shown in Figure 7. As seen in Figure 7. the 
HSstihXm replication defect when grown in LP-293 cells at 38.5°C is seen: this defect is not 
present when the virus is grown at the same temperature in either B-6 or C-7 cells. 

The results depicted in Figure 7 demonstrates that both cell lines B-6 and C-7 could 
trans-complement the replication defect of HSsuhlOQ. This result demonstrated that the 
expression of the Ad polymerase in B-6 cells was able to overcome the preterminal protein- 
mediated replication defect of HSiuAIOO. While not limiting the present invention to any 
particular mechanism, the ability of Ad polymerase to overcome the preterminal protein- 
mediated replication defect of H5sub\QQ may be due to a direct physical interaction of the 
polymerase with the amino-terminus of the H5ii/6l00-derived preterminal protein. 
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replication defect can be overcome when LP-293 cells were infected with a 100-fold tireaier 
amount of H5.S7/6I00. However, complementation of the H5suh\00 replication defect is not 
sufficient to overcome the growth defect o[ HSsublOO, since temperature shift-up experiments 
have demonstrated that the H5.v//^100 growth defect is not dependent upon viral replication 
[Schaack ei ciL (1995). supra]. Therefore, the overexpression of the Ad-polymerase must 
have allowed a very low level but detectable production of infectious H5.v///7lOO particles in 
the B-6 cell line. The reduced growth of HS^w/jIGO is therefore not due to a replication 
defect, but rather some other critical activitv that the preterminal protein has a role in, such as 
augmention of viral transcription by association with the nuclear matrix [Schaack and Shenk 
(1989) Curr. Top. Microbiol, Immunol. 144:185 and Hauser and Chamberlain (1996) J. Endo. 
149:373]. This was confirmed by assessing the ability of the C-7 cell line to overcome the 
growth defect of H5jz/6I00 utilizing one-step growth assays performed as follows. 

Each of the cell lines (LP-293. B-6 and C-7) were seeded onto 60 mm dishes at 2.0 \ 
10'' cells/dish. The cell lines were infected at an MOl of 4 with each of the appropriate 
viruses (wtAd5. H5ts36, or H5iz//} 100). and incubated at 38.5°C for 40 hours. The total 
amount of infectious virions produced in each 60 mm dish was released from the cell lysates 
by three cycles of freeze-thawing. and the titer was then determined by limiting dilution and 
plaque assay on B-6 cells at 38.5°C. The results are summarized in Figure 8. 

The results shown in Figure 8. demonstrated that even though the B-6 cell line allowed 
normal replication and plaque formation of H55//6100 at 38.5*'C (in fact. B-6 cells were 
utilized to determine the plaque tiires depicted in Figure 8) they could not allow high level 
growth of HSsuhXQQ and only produced titres of W5sub\00 equivalent to that produced by the 
LP-293 cells. The C-7 cell line produced 100 fold more virus than the LP-293 or B-6 cells, 
see Figure 8. Encouragingly, the titre of H5.yw6100 produced by the C-7 cells approached 
titres produced by LP-293 cells infected with wild-type virus. Ad5. When the \\Ssub\QQ 
virions produced from infection of the C-7 cells were used to infect LP-293 cells at 38,5^C. 
all virus produced retained the t:^ mutation {i.e,. at least a 1000 fold drop in pfu was detected 
when LP-293 cells were respectively infected at 38.5°C vs. 32.0°C). This finding effectively 
rules out the theoretical possibility that the \\5sub\QQ input virus genomes recombined with 
the preterminal protein sequences present in the C-7 cells. 

In addition, the C-7 cell line allowed the high level growth of H5ts36. demonstrating 
that adequate amounts of the Ad-polymerase protein were also being expressed. The C-7 cell 
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line was capable of tSrcompiementing the growih of both H!tr36 and H5.-,/^100 after 4 
months of serial passaging, demonstrating that the coexpression of the El. preterminal, and 
polymerase proteins was not toxic. 

These results demonstrate that the constitutive expression of both the polvmerase and 
preterminal proteins is not detrimental to normal virus production, which might have occurred 
if one or both of the proteins had to be expressed only during a narrow time period durine the 
Ad life cycle. In summary, these results demonstrated that the C-7 cell line can be used as a 
packaging cell line to allow the high level growth of El. preterminal, and polvmerase deleted 



Ad vectors. 



EXAMPLE 3 

Production Of Adenovirus Vectors 
Deleted For El And Polymerase Functions 

in order to produce an Ad vectors deleted for El and polymerase functions, a small, 
frame-shifting deletion was introduced into the Ad-pol gene contained within an EI -deleted 
Ad genome. The plasmid pBHGl 1 (Microbix) was used as the source of an El -deleted Ad 
genome. pBHGl 1 contains a deletion of Ad5 sequences from bp 188 to bp 1339 (0.5-3.7 
m.u.): this deletion removes the packagmg signals as well as El sequences. pBHGlI also 
contains a large deletion within the E3 region (bp 27865 to bp 30995: 77.5-86.2 m.u.). The 
nucleotide sequence of pBHGI I is listed m SEQ ID NO:4 [for cross-corrleat.on between the 
pBHGI 1 sequence and the Ad5 genome (SEQ ID NO: I), it is noted that nucleotide 8.773 in 
pBHGl 1 is equivalent to nucleotide 7.269 in Ad5]. 

pBHGl 1 was chosen to provide the Ad backbone because this plasmid contains a large 
deletion within the E3 region (77.5 to 86.2 m.u.) and therefore vectors derived from this 
plasmid permit the insertion of large pieces of foreign DNA. A large cloning capacity is 
important when the pol" vectors is to be used to transfer a large gene such as the dystrophin 
gene (cDNA = 13.6 kb). However, the majority of genes are not this large and therefore 
other Ad backbones containing smaller deletions within the E3 region (e.g.. pBHGlO which 
contains a deletion between 78.3 to 85.8 m.u.: Microb.x) may be employed for the 
construction of pol" vectors using the strategy outlined below. 
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a) ConstriKTfon Of A Plasmid Containing A PorriroTn Of The 

Adenovirus Genome Containing The Polymerase Gene 

A fragmeni of ihe Ad genome containing the pol gene located on pBHGl 1 was 
subcioncd to create pBSA-XB. Due to the large size of the Ad genome, this intermediate 
plasmid was constructed to facilitate the introduction of a deletion within the pol Lzene the pol 
deletion. pBSA-XB was constructed as follows. The polyiinker region of pBluescript 
(Stratagene) was modified to include additional restriction enzyme recognition sites (the 
sequence of the modified polyiinker is provided in SEQ ID NO:5: the remainder of 
pBluescript was not altered); the resulting plasmid was termed pBSX. pBHGll was digested 
with .Vbal and BamHl and the 20.223 kb fragment containing the pol and pTP coding regions 
(E2b region) was inserted into pBSX digested with .Vbal and BamHl to generate pBSA-XB. 

b) Construction Of pBHGIiApol 

A deletion was introduced into the pol coding region contained within pBSA-XB in 
such a manner that other key viral elements were not disturbed (e.g., the major late promoter, 
the tripartite leader sequences, the pTP gene and other leader sequences critical for normal 
virus viability). The deletion of the pol sequences was carried out as follows. pBSA-XB was 
digested with DspEl and the ends were filled in using T4 DNA polymerase. The BspEl- 
digested. T4 polymerase filled DNA was then digested with BamHl and the 8809 bp 
BamHl/ BspElifiWcd) fragment was isolated as follows. The treated DNA was run on a 0.6% 
agarose gel (TAE buffer) and the 8809 bp fragment was excised from the gel and purified 
using a QIAEX Gel Extraction Kit according to the manufacturer's instructions (OIAGEN, 
Chatsworth. CA). 

A second aliquot of pBSA-XB DNA was digested with BspHl and the ends were filled 
in with T4 DNA polymerase. The S.vpHI-digested. T4 polymerase filled DNA was then 
digested with BamHl and the 13.679 bp BamHl/ BspHlifiUQd) fragment was isolated as 
described above. 

The purified 8809 bp BamHl/ BspEUmi^d) fragment and the purified 13.679 bp 
BamHl/ BspHlifiUcd) fragment were ligated to generate pApol. pApol contains a 612 bp 
deletion within the pol gene (bp 8772 to 9385; numbering relative to that of pBHGll) and 
lacks the I 1.4 kb BamHl fragment containing the right arm of the Ad genome found within 
pBHGll. 
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To provide thKl^hl arm of the Ad genome. pApol was dit-ested with BamHl followed 
by treatment with calf intestinal alkaline phosphatase. pBHGI I was digested with BamH] and 
the I 1.4 kb fragment was isolated and purified using a QIAEX Gel Extraction Kit as 
described above. The purified 1 1.4 kb BamHl fragment was ligated to the 
Z?a/nHl/phosphatased pApol to generate pBHGllApol. Proper construction of pBHGI I Apol 
was confirmed restriction digestion (Hindlll). 

c) Rescue And Propagation Of AdSApol Virus 

The Ad genome contained within pBHGl lApol lacks the packaging signals. 
Therefore, in order to recover virus containing the 612 bp deletion within the pol gene from 
pBHGl 1 Apol. this plasmid must be cotransfected into packaging cells along with DNA that 
provides a source of the Ad packaging signals. The Ad packaging signals may be provided 
by wild-type or mutant Ad viral DNA or alternatively may be provided using a shuttle vector 
which contains the left-end Ad5 sequences including the packaging signals such as 
pAEIsplA. pAElspIB (Microbix) or pAdBglll (pAdBglll is a standard shuttle vector which 
contains 0-1 m.u. and 9-16 m.u. of the adenovirus genome). 

To rescue virus. pBHGl lApol was co-transfected with Ad5J/700l viral DNA . 
Ad5J/700I contains a deletion in the E3 region: the E3 deletion contained within Ad5c//700l 
is smaller than the deletion contained within the Ad genome contained within pBHGI 1. It is 
not necessary that Ad5 J/7001 be used to recover virus; other adenoviruses, including wild- 
type adenoviruses, may be used to rescue of virus from pBHGl I Apol. 

It has been reported that the generation of recombinant Ads is more efficient if Ad 
DNA-terminal protein complex (TPC) is employed in conjunction with a plasmid containing 
the desired deletion [Miyake ei ctl. (1996) Proc. Natl. Acad. Sci. USA 93:1320]. Accordingly. 
Ad5dl700l-TPC were prepared as described (Miyake cv al. (1996). supra]. Briefly, purified 
Ad5 J/7001 virions (purified through an isopycnic CsCl gradient centered at 1.34 g/ml) were 
lysed by the addition of an equal volume of 8 M guanidine hydrochloride. The released 
Ad5c//7001 DNA-TPC was then purified through a buoyant density gradient of 2.8 M CsCI/4 
M guanidine hydrochloride by centrifugation for 16 hr at 55.000 rpm in a VTi65 rotor 
(Beckman). Gradient fractions containing AdSJ/TOOI DNA-TPC were identified using an 
ethidium bromide spot test and then pooled, dialyzed extensively against TE buffer. BSA was 
then added to a final concentration of 0.5 mg/ml and aliquots were stored at -80°C. The 
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Ad5 J/7001 DNA-TPC was then digested with Seal and then gePfihered through a Sephadex 

G-50 spin column. 

One hundred nanograms of the digested Ad5t//7001 DNA-TPC was mixed with 5 |.m 
of pBHGl iApol and used to transfect pol-expressing 293 cells (i.e.. C-7). Approximately 10 
days post-transfection. plaques were picked. The recombinant viruses were plaque purified 
and propagated using standard techniques (Graham and Prevac. supra). Viral DNA was 
isolated, digested with restriction enzymes and subjected to Southern blotting analysis to 
determine the organization of the recovered viruses. Two forms of virus containing the 612 
bp pol deletion were recovered and termed Ad5ApolAE3I and Ad5ApolAE3II. One form of 
recombinant pol' virus recovered, Ad5ApolAE3L underwent a double recombination event 
with the Ad5t//7001 sequences and contains the E3 deletion contained within Ad5 J/7001 at 
the right end of the genome. The second form of recombinant pel' virus. Ad5ApolAE3II. 
retained pBHGl 1 sequences at the right end of the genome (i.e.. contained the E3 deletion 
found within pBHGl 1). These results demonstrate the production of a recombinant Ad vector 
containing a deletion within the pol gene. 

d) Characterization Of The E1+, pol- Viruses 

To demonstrate that the deletion contained within these two pol" viruses renders the 
virus incapable of producing functional polymerase. Ad5ApolAE3! was used to infect 293 
cells and pol-expressing 293 cells (B-6 and C-7 cell lines) and a viral replication- 
complementation assay was performed as described in Example le. Briefly. 293. B-6 and C-7 
cells were seeded onto 60 mm dishes at a density of 2 x 106 cells/dish and infected with 
H5siih\00 or Ad5ApolAE3I at an MOl of 0.25. The infected cells were then incubated at 
37''C or 38.5'^C for 16 hours, or at 32.0°C for 40 hours. Cells from each infected plate were 
then harvested and total DNA was extracted. Four micrograms of each DNA sample was 
digested with HindlU, eiectrophoresised through an agarose gel. transferred to a nylon 
membrane and probed with '-P-labeled adenoviral DNA. The resulting autograph is shown in 
Figure 9. In Figure 9, each panel shows, from left to right. DNA extracted from 293, B-6 
and C-7 cells, respectively infected with either U5si4b\00 or Ad5ApolAE31 (labeled 
AdSAPOL in Fig. 9). 

As shown in Figure 9, the recombinant pol" virus was found to be viable on pol- 
expressing 293 cells but not on 293 cells. These results demonstrates that recombinant Ad 
viruses containing the 612 bp deletion found within pApol lack the ability to express Ad 
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polymerase. These re^s also demonstrate that B-6 and C-7 cells etncently complement the 
Apol found within Ad5ApolAE3I and Ad5ApolAE3Il. in addition, these results show that 
replication of the ts pTP mutant HSsublOO can be complemented by high level expression of 
the Ad polymerase with or without co-expression of pTP. 

Because the expression of early genes is required for the expression of the late gene 
products, the ability of recombinam viruses contaning the Apol deletion to direct the 
expression of late gene products was examined. 293 and C-7 cells were infected with 
Ad5ApolAE31 and 24 hours after infection cell extracts were prepared. The cell extracts were 
serially diluted and examined for the expression of the fiber protein (a late gene product) by 
immunoblot analysis. The immunoblot was performed as described in Example 3C with the 
exception that the primary antibody used was an anti-fiber antibody (FIBER-KNOB obtained 
from Robert Gerard. University of Texas Southwestern Medical School). The results of this 
immunoblotting analysis revealed that no fiber protein was detected from 293 cells infected 
with Ad5ApolAE3l (at any dilutuion of the cell extract). In comrast. even a 1:1000 dilution 
of cell extract prepared from C-7 cells infected with Ad5ApolAE31 produced a visible band 
on the immunoblot. Therefore, the pol deletion contained within Ad5ApolAE3I resulted in a 
greater than 1 000- fold decrease in fiber production. 

The above results demonstrate that polymerase gene sequences can be deleted from the 
virus and that the resulting deleted virus will only grow on cells producing Ad-polymerase ,n 
inim. Using the pol-expressing cell lines described herein {c.^.. B-6 and C-7). large 
quantities of the pol" viruses can be prepared. A dramatic shut-down in growth and late gene 
expression is seen when cells which do not express Ad polymerase are infected with the pol" 



viruses. 



e) Generatioa Of El', Pol" Ad Vectors 

Ad5 J/7001 used above to recuse virus containing the polymerase deletion is an El- 
containing virus. The presence of El sequences on the recombinant pol" viruses is 
undesirable when the recombinant virus is to be used to transfer genes into the tissues of 
animals: the El region encodes the transforming genes and such viruses replicate extremely 
well in vivo leading to an immune response directed against cells infected with the El- 
containing virus. 

EI - viruses containing the above-described polymerase deletion are generated as 
follow.s. pBHGl lApol is cotransfected into pol-expressing 293 cells {e.f;.. B-6 or C-7) along 
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vviih a shuttle vector containing the left-end Ad5 sequences including ihe packaging signals. 
Suitable shuttle vectors include pAElspI A (Microbix). pAElsplB (Microbix) or pAdBglll. 
The gene of interest is inserted into the poiylinker region of the shuttle vector and this 
plasmid is then cotransfecied into B-6 or C-7 cells along with pBHGl 1 Apol lo generate a 
recombinant EP. pol" Ad vector containing the gene of interest. 



EXAMPLE 4 
Production Of Adenovirus Vectors 
Deleted For El And Preterminal Protein Functions 



In order to produce an Ad vectors deleted for El and preterminal protein functions, a 
small deletion was introduced into the Ad preterminal protein (pTP) gene contained within an 
El -deleted Ad genome. The plasmid pBHGl I (Microbix) was used as the source of an El- 
deleted Ad genome to maximize the cloning capacity of the resulting pTP" vector. However, 
other Ad backbones containing smaller deletions within the E3 region (e.^,, pBHGlO which 
contains a deletion between 78.3 to 85.8 m.u.; Microbix) may be employed for the 
construction of pTP" vectors using the strategy outlined below. 

a) Construction Of pApTP 

A deletion w^as introduced into the pTP coding region contained within pBSA-XB (Ex. 
3) in such a manner that other key viral elements were not disturbed (e.g., the tripartite leader 
sequences, the i-leader sequences, the VA-RNA 1 and II genes, the 55 kD gene and the pol 
gene). The deletion of the pTP sequences was carried out as follows. pBSA-XB was 
digested with Xbcd and EcoRV and the 7.875 kb fragment was isolated as described (Ex. 3). 
Another aliquot of pBSA-XB was digested with Miml and the ends were filled in using T4 
DNA polymerase. The /V/w/?! -digested, T4 polymerase filled DNA was then digested with 
Xbal and the 14.894 kb Xbal/MunlifiWed) fragment was isolated as described (Ex. 3). The 
7.875 kb :Vhm\ fragment and the 14.894 kh Xha\/Miml{f\\\Qd) fragment were ligated together 
to generate pApTP. 
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b) Construction Of pBHG II ApTP 

pApol contains a 429 bp deletion within the pTP gene (bp 10.705 to 1 1.134; 
numbering relative to that of pBHGil) and lacks the 11 .4 kb ^^mrHI fragment containmg the 
right arm of the Ad genome found within pBHGl I. 

To provide the right arm of the Ad genome. pApTP was digested with BamHl 
followed by treatment with shrimp alkaline phosphatase (SAP: U.S. Biochemicals. Cleveland. 
OH). pBHGI 1 was digested with BamHl and the 1 1.4 kb fragment was isolated and purified 
using a QIAEX Gel Extraction Kit as described (E.x. 3). The purified 1 1.4 kb DamH\ 
fragment was ligated to the 5amHI/phosphatased pApTP to generate pBHGl lApTP. Proper 
construction of pBHGl 1 ApTP was confirmed restriction digestion. 

c) Rescue And Propagation Of Ad Vectors Containing The pTP 
Deletion 

The Ad genome contained within pBHGl lApTP lacks the Ad packaging signals. 
15 Therefore, in order to recover virus containing the 612 bp deletion within the pol gene from 
pBHGl lApol. this plasmid must be cotransfected into packaging cells along with DNA that 
provides a source of the Ad packaging signals. The Ad packaging signals may be provided 
by wild-type or mutant Ad viral DNA or alternatively may be provided using a shuttle vector 
which contains the left-end Ad5 sequences including the packaging signals such as 
20 pAElsplA. pAElsplB (Microbix) or pAdBglll. 

Recombinant Ad vectors containing the pTP deletion which contain a deletion within 
the E3 region are generated by cotransfection of pBHGl lApTP (the gene of interest is 
inserted into the unique Pad site of pBHGIlApTP) with a E3-deleted Ad virus such as 
Ad5dl700 into pTP-expressing 293 cells (e.g.. C-7); viral DNA-TPC are utilized as described 
25 . above in E.xampie 3. 

Recombinant vectors containing the pTP deletion which also contain deletions within 
the El and E3 regions are generated by cotransfection of pBHGllApTP into pTP-expressing 
293 cells {e.g.. C-7) along with a shuttle vector containing the left-end Ad5 sequences 
including the packaging signals. Suitable shuttle vectors include pAElsplA (Microbix). 
pAElsplB (Microbix) or pAdBglll. The gene of interest is inserted into the polylinker region 
of the shuttle vector and this plasmid is then cotransfected into B-6 or C-7 cells along with 
pBHGl lApTP to generate a recombinant El". pTP" Ad vector containing the gene of 
interest. 
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EXAMPLE 5 

Production Of Adenovirus Vectors Deleted For 
El. Polvmerase And Preterminal Protein Functions 



In order to produce an Ad vectors deleted for El. polymerase and preterminal protein 
functions, a deletion encompassing pol and pTP gene sequences was introduced into the Ad 
sequences contained within an El -deleted Ad genome. The plasmid pBHGI 1AE4 was used 
as the source of an El -deleted Ad genome to maximize the cloning capacity of the resulting 
pol". pTP" vector. pBHGl IAE4 is a modified form of BHGl 1 which contains a deletion of 
all E4 genes except for the E4 ORJ 6; the E4 region was deleted to create more room for the 
insertion of a gene of interest and to further disable the virus. However, other El" Ad 
backbones, such as pBHGll and pBHGl (Microbix; pBHGlO contains a smaller deletion 
within the E3 region as compared to pBHGl 1). may be employed for the construction of 
pol". pTP" vectors using the strategy outlined below. 

Due to the complexity of the cloning steps required to introduce a 2.3 kb deletion that 
removes portions of both the pTP and pol genes, this deletion was generated using several 
steps as detailed below. 

a) Construction Of pAXBApolApTPVARNA+tl3 

In order to create a plasmid containing Ad sequences that have a deletion within the 
pol and pTP genes, pAXBApolApTPVARNA+tl3 was constructed as follows. 

pBSA-XB was digested with BspEl and the 18 kb fragment was isolated and 
recircularized to create pAXBApolApTP; this plasmid contains a deletion of the sequences 
contained between the RvpEI sites located at 8.773 and 12.513 (numbering relative to 
pBHGll). 

A fragment encoding the VA-RNA3 sequence and the third leader of the tri-partite 
leader sequence was prepared using the PGR as follows. The PGR was carried out in a 
solution containing H5ts36 virion DNA (any Ad DNA. including wild-type Ad. may be used), 
2 ng/mL of primers 4005E and 4006E. 10 mM Tris HCl, pH 8.3. 50 mM KGl. K5 mM 
MgCK. 0.001% gelatin and Pfu polymerase. The forward primer. 4005E, [5'- 
TGCCGCAGCACCGGATGGATC-3^ (SEQ ID NO:6)| contains sequences complementary to 
residues 12.551 to 12.571 of pBHGll (SEQ ID NO:4). The reverse primer. 4006E, [5*- 
GCG TCGGGA GGCTGCCATG CGGCAGGG-3' (SEQ ID NO:7)] is complementary to 
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residues 1 1 .09 1 to 1 1 . 1 08 ot pBHGIl fSEQ ID NO:4) as well as a 5,s/;EI site (underlined). 
The predicted sequence of the kb PCR product is listed in SEQ ID NI0:8. 

PCR was performed with a Peri<in Elmer 9600 Thermocycler utilizing the tbllowmg 
cycling parameters: m.t.al denaturaiion at 94°C for 3 mm. .> cycles of denaturation at 94°C 
for 30 sec. annealing at 50°C for 30 sec. and extension at 72°C for 60 sec. followed by 
another 27 cycles whh an increased annealing temperature at 56'C. with a fmal extension at 
72°C for 10 minutes. The -1.6 kb PCR product was purified using a QlAEX Gel Extraction 
Kit as described (Ex. 3). 

The purified PCR fragment was then digested with BspEl. pAXBApolApTP was 
digested with BspEl followed by treatment with SAP. The fi.;7E I -digested PCR fragment and 
the RvpEI-SAP-treated pAXBApolApTP were ligated together to create 
pAXBApol ApTPV ARN A+t 1 3. 

b) Construction Of pBHG 1 1 ApolApTPVARNA+tl3 

pAXBApolApTPVARNA+tl3 contains a 2.3 kb deletion within the pol and pTP genes 
(bp 8.773 to 11. 091: numbering relative to that of pBHGll) and lacks the I 1 .4 kb BamHl 
fragment containing the right arm of the Ad genome. 

To provide the right arm of the Ad genome. pAXBApol ApTPV ARNA+t 1 3 was 
digested with BamHl followed by treatment with SAP. pBHGl 1AE4 (pBHGl 1 or pBHGIO 
may be used in place of pBHGl IAE4) was digested with BamHl and the 1 1.4 kb fragment 
was isolated and purified using a QlAEX Gel E.xtraction Kit as described (Ex. 3). The 
purified 1 1.4 kb BamHl fragment was ligated to the Sa/wHI/phosphatased 
pAXBApolApTPVARNA+tl3 to generate pBHGl 1 ApolApTPVARJ^A+tl3. Proper 
construction of pBHGl lApolApTPV ARNA+t 13 was confirmed restriction digestion. 

c) Rescue And Propagation Of Ad Vectors Containing The pol, 
pTP Double Deletion 

The Ad genome contained within pBHGl lApol ApTPV ARNA+t 13 lacks the Ad 
packaging signals. Therefore, in order to recover virus containing the 2.3 kb deletion within 
the pol and pTP genes from pBHGI 1 ApolApTPVARMA+tl 3. this plasmid must be 
cotransfected into packaging cells along with DNA that provides a source of the Ad 
packaging signals. The Ad packaging signals may be provided by wild-type or mutant Ad 
viral DNA or alternatively may be provided using a shuttle vector which contains the left-end 
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Ad5 sequences including the packaging signals such as pAElspTA. pAElsplB (Microbix) or 
pAdBglll. 

Recombinant vAd vectors containing the pol. pTP double deletion which also contain a 

deletion within the E3 region are generated by cotransfection of 
5 pBHGl 1 ApolApTPVARNA-+-tl3 (the gene of interest is inserted into the unique Pad site of 

pBHGI lApolApTPVARNA+tl3) with a E3-deleled Ad virus such as Ad5t//7001 into pol- and 

pTP-expressing 293 cells (e.g.. C-7); viral DNA-TPC are utilized as described (Ex. 3). 

Recombinant vectors containing the pol, pTP double deletion which also contain 

deletions within the El and E3 regions are generated by cotransfection of 
!0 pBHGl 1 ApolApTPVARNA+tl3 into pol- and pTP-expressing 293 cells (e.g.. C-7) along with 

a shuttle vector containing the left-end Ad5 sequences including the packaging signals. 

Suitable shuttle vectors include pAEIsplA (Microbix), pAElsplB (Microbix) or pAdBglll. 

The gene of interest is inserted into the polylinker region of the shuttle vector and this 

plasmid is then cotransfected into B-6 or C-7 cells along with 
15 pBHGl 1 ApolApTPVARNA+tl3 to generate a recombinant El", pol pTP~ Ad vector 

containing the uene of interest. 



EXAMPLE 6 

Encapsidated Adenovirus Minichromosomes 
20 Containing A Full Length Dystrophin cDNA 



In this Example, the construction of an encapsidated adenovirus minichromosome 
(EAM) consisting of an infectious encapsidated linear genome containing Ad origins of 
replication, packaging signal elements, a p-galactosidase reporter gene cassette and a full 

25 length (14 kb) dystrophin cDNA regulated by a muscle specific enhancer/promoter is 

described. EAMs are generated by cotransfecting 293 cells with supercoiled plasmid DNA 
(pAd5pdys) containing an embedded inverted origin of replication (and the remaining above 
elements) together with linear DNA from El -deleted virions expressing human placental 
alkaline phosphatase (hpAP). All proteins necessary for the generation of EAMs are provided 

30 in iram from the hpAP virions and the two can be separated from each other on equilibrium 
CsCI gradients. These EAMs are useful for gene transfer to a variety of cell types both in 
vifro and in vivo. 
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a) Generation And Propagation Of Encapsidated Adenovirus 
iVlinichromosomes 

To establish a vector system capable of delivering full length dystrophin cDNA clones, 
the minimal region of Ad5 needed for replication and packaging was combined with a 
conventional plasmid carrying both dystrophin and a P-galactosidase reporter cene. In the 
first vector constructed, these elements were arranged such that the viral ITRs tlanked (ITRs 
facing outward) the reporter gene and the dystrophin gene \i.c.. the sector contained from S' 
to 3- the right or 3" ITR (mu 100 to 99). the dystrophin gene and the reporter gene and the 
left or 5- ITR and packaging sequnce (mu I to 0)). Upon introduction of this vector along 
with helper virus into 293 cells, no encapsidated adenovirus minichromosomes were 
recovered. 

The second and successful vector. pAd5pdys (Fig. 10) contains 2.1 kb of adenovirus 
DNA. together with a 14 kb murine dystrophin cDNA under the control of the mouse muscle 
creatine kinase enhancer/promoter, as well as a p-galactosidase gene regulated by the human 
cytomegalovirus enhancer/promoter. 

Figure 10 shows the structure of pAd5pdys (27.8 kb). The two inverted adenovirus 
origins of replication are represented by a left and right inverted terminal repeat 
(LITRyRJTR). Replication from these ITRs generates a linear genome whose termini 
correspond to the 0 and 100 map unit (mu) locations. Orientation of the origin with respect 
to wild type adenovirus serotype 5 sequences in mu is indicated above the figure (1 mu=360 
bp). Fncapsidation of the mature linear genome is enabled by five (Al-AV) packaging signals 
(4^). The E. tali P-galactosidase and mn.s musculus dystrophin cDNAs are regulated by 
cytomegalovirus (CMV) and muscle creatine kinase (MCK) enhancer/promoter elements, 
respectively. Both e.xpression cassettes contain the SV40 polyadenylation (pA) signal. Since 

. the El A enhancer/promoter overlaps with the packaging signals. pAd5pdys was engineered 
such that RI^A polymerase transcribing from the El A enhancer/promoter will encounter the 
SV40 late polyadenylation signal. Pertinent restriction sites used in constructing pAdSpdys 

_are indicated below the figure. An adenovirus fragment corresponding to mu 6.97 to 7.77 
was introduced into pAdSpdys during the cloning procedure (described below). PI and P2 
represent location of probes used for Southern blot analysis. Restriction sites destroyed 
during the cloning of the Ad5 origin of replication and packaging signal are indicated in 
parentheses. 
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pAd5pdys was constructed as follows. pBSX (Ex. 3a) was used as the backbone tor 
construction of pAd5(3dys. The inverted fused Ad5 origin of replication and five 
encapsidaiion signals were excised as a PstUXbal fragment from pAd5ori. a plasmid 
containing the 6 kb HindlU fragment from pFG!40 (Microbix Biosystems). This siratecy also 
introduced a 290 bp fragment from Ad5 corresponding to map units 6.97 to 7.77 adjacent to 
the right inverted repeat (see Fig. 10). The E. coli p-galactosidase gene regulated by the 
human CMV immediate early (IE) promoter/enhancer expression cassette was derived as an 
EcoRl/HindlU fragment from pCMVp [MacGregor and Caskey (1989) Nucleic Acids Res. 
17:2365]: the CMV IE enhancer/promoter is available from a number of suppliers as is the E. 
coli P-galaciosidase gene]. The murine dystrophin expression cassette was derived as a 
BssHW fragment from pCVAA. and contains a 3.3 kb MCK promoter/enhancer element 
[Phelps ei al. (1995) Hum. Mol. Genet. 4:1251 and Jaynes ef ai (1986) MoL Cell. Biol. 
6:2855]. The sequence of the -3.3 kb MCK promoter/enhancer element is provided in SEQ ID 
N0:9 (in SEQ ID NO:9. the last nucleotide of SEQ ID NO:9 corresponds to nucleotide +7 of 
the MCK gene). The enhancer element is contained within a 206 bp fragment located -1256 
and -1050 upstream of the transcription start site in the MCK gene. 

It was hoped that the pAdSpdys plasmid would be packageable into an encapsidated 
minichromosome when grown in parallel with an EI -deleted virus due to the inclusion of both 
inverted terminal repeats (ITR) and the major Ad packaging signals in the plasmid. The ITRs 
and packaging signals were derived from pFGHO (Microbix). a plasmid that generates El- 
defective Ad particles upon transfection of human 293 cells. 

hpAP is an El deleted Ad5 containing the human placental alkaline phosphatase gene 
[Muller et ai (1994) Circ. Res. 75:1039]. This virus was chosen to provide the helper 
functions so that it would be possible to monitor the titer of the helper virus throughout serial 
passages by quantitative alkaline phosphatase assays. 

293 cells were cotransfected with pAdSpdys and hpAP DNA as follows. Low passage 
293 cells (Microbix Biosystems) were grown and passaged as suggested by the supplier. Five 
pAd5pdys and hpAP DNA (5 and 0.5 |dg, respectively) were dissolved in 70 ^1 of 20 mM 
HEPES buffer (pH 7.4) and incubated with 30 |il of DOTAP (BMB) for 15 min. at room 
temperature. This mixture was resuspended in 2 mis of DMEM supplemented with 2% fetal 
calf serum (PCS) and added dropwise to a 60 mm plate of 293 cells at 80% confluency. Four 
hours post-transfeciion the media was replaced by DMEM with 10% PCS. Cytopathic effect 
was observed 6-12 days posi-iransfection. 
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Cotransteccion of 293 cells with supercoiied pAd5pdys and linear hpAP DNA 
produced Ad5pdys EAMs (the encapsidated version of pAdipdys) approximately 6 to 12 days 
post iransfection. as evidenced by the appearance of a cyiopathic effect (CPE). Initially, the 
amount oi Ad5pdys EAMs produced was significantly lower than that of hpAP virions so the 
V iral suspension was used to re-.nfect fresh cultures, from which virus was isolated and used 
for .serial infection of additional cultures (Fig. 1 1 ). Infection and serial passaging of 293 cells 
was carried out as follows. 

Lysate from one 60 mm plate of transfected 293 cells was prepared by vigorously 
washing the cells from the plate and centrifuging at 1 K rpm in a clinical centrifuge. Cells 
were resuspended in DMEM and 2% PCS. freeze-thawed in a dry ice-ethanol bath, ceil debris 
removed by centrifugation. and approximately 75% of the crude lysate was used to infect 293 
cells in DMEM supplemented with 2% PCS for 1 hour and then supplemented with 10% PCS 
thereafter. Infection was allowed to proceed for 18-20 hrs before harvesting the virus. The 
total number of cells infected in each serial passage is indicated in Pigure II. 

in Pigure 1 1, the total number of transducing adenovirus particles produced (output) 
per serial passage on 293 cells, total input virus of either the helper (hpAP) or .Ad5pdys, and 
the total number of cells used in each infection is presented. The total number of input/output 
transducing particles were determined by infection of 293 cells plated in 6-weil microtiter 
plates. Twenty four hours post-infection the cells were assayed for alkaline phosphatase or p- 
galactosidase activity (described below) to determine the number of cells transduced with 
either .Ad5Pdys or hpAP. The number of transducing particles were estimated by 
extrapolation of the mean calculated from 31 randomly chosen 2.5 mm- sectors of a 961 mm' 
plate. The intra-sector differences in total output of each type of virus are presented as the 
standard deviations, a. in Figure 11. For each serial passage. 75% of the total output virus 
from the previous passage was used for infection. 

Alkaline phosphatase or p-galactosidase activity was determined as follows. For 
detection of alkaline phosphatase, infected 293 cells on Petri dishes were rinsed twice with 
phosphate buffered saline (PBS) and fixed for 10 minutes in 0.5% glutaraldehyde in PBS. 
Cells were again rinsed twice with PBS for ten minutes followed by inactivation of 
endogenous alkaline phosphatase activity at 65°C for I hr. in PBS prior to the addition of the 
chromogenic substrate BCIP (5-bromo-4-chloro-3-indolyl phosphate) at 0.15 mg/ml and nitro 
blue tetrazolium at 0.3 mg/ml) . Cells were incubated at 37°C in darkness for 3- 24 hrs. For 
P-calactosidase assays, the cells were fixed and washed as above, then assayed using standard 
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methods [MacGregor ei uL (1991) In Murray (ed.). Methods in Molecular Biology, Vol. ~: 
Gene Transfer and Expression Protocols, Human Press Inc.. Clifton, NJ. pp. 217-225]. 

As shown in Figure 11. the rate of increase in tiler of Adifidys EAMs between 
transfeciion and serial passage 6 was approximately 100 times greater than that for hpAP 
virions. This result indicated that AdSpdys has a replication advantage over the helper virus, 
probably due to the shorter genome length (a difference of approximately 8 kb) and hence an 
increased rate of packaging. 

Interestingly, after serial passage 6 there was a rapid decrease in the total titer of hpAP 
virions whereas the titer of AdSpdys EAMs continued to rise. While not limiting the present 
invention to any particular mechanism, at least two possible mechanisms could be responsible 
for this observation. Firstly, a buildup of defective hpAP virions due to infections at high 
multiplicities may slowly out-compete their full length counterparts, a phenomenon that has 
been previously observed upon serial propagation of adenovirus [Daniel! (1976) J. Virol. 
19:685: Rosenwirth et al. (1974) Virology 60:431; and Burlingham et al. (1974) Virology 
60:419]. Secondly, the emergence of replication competent virions due to recombination 
events between El sequences in cellular DNA and the hpAP genome could lead to a buildup 
of virus particles defective in expressing alkaline phosphatase [Lochmiiller et al. (1994) Hum. 
Gene Ther. 5:1485], 

Southern analysis of DNA prepared from serial lysates 3. 6. 9 and 12 indicated that 
full length dystrophin sequences were present in each of these lysates (Fig. 12A). In addition, 
the correct size restriction fragments were detected using both dystrophin and p-galactosidase 
probes against lysate DNA digested with several enzymes (Figs. 12A-B). 

Figure 12 shows a Southern blot analysis of viral DNA from lysates 3. 6. 9 and 12, 
digested with the restriction enzymes 5a\sHII. ,VrwI and EcoRV, indicating the presence of a 
full length dystrophin cDNA in all lysates. Fragments from the C terminus of mtts musculus 
dystrophin cDNA (A) or the N terminus of E coli p-galactosidase (B) were labeled with 
dCTP'- and used as probes [Sambrook et oL, supra]. The position of these probes and the 
predicted fragments for each digest is indicated in Figure 10. Note that one end of each 
fragment (except the 17.8 kb BssHll dystrophin fragment) detected is derived from the end of 
the linearized AdSpdys genome (see Fig. 10). Low levels of shoner products, presumably 
derived from defective virions, become detectable only at high serial passage number. 

.At the later passages (9 and 12) there appeared to be an emergence of truncated 
Ad5pdys sequences, suggesting that deletions and/or rearrangements may be occurring at later 
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passages. Hence, most experiments were performed with Adipdys EAMs derived from the 
earlier passages. The possibility of the emergence of replication competent (El-containinu) 
viruses was also examined by infection of HeLa ceils by purified and crude serial lysates. 
none of which produced any detectable CPE. 

5 

b) Purification Of Encapsidated Adenovirus Minichromcsomes 
On CsCI Gradients 

The ability to separate AdSPdys EAMs from hpAP virions based on their buoyancy 
difference (due presumably to their different genome lengths) on CsCl gradients was 

10 examined. Repeated fractionation of the viral lysate allows small differences in buoyancy to 
be resolved. CsCl purification of encapsidated adenovirus minichromosomes was performed 
as follows. Approximately 25% of the lysate prepared from various passages during serial 
infections was used to purify virions. Freeze-thawed lysate was centrifuged to remove the 
cell debris. The cleared lysate was extracted twice with 1.1.2 tricholorotrifluoroethane 

15 (Sigma) and applied to CsCl step and self forming gradients. 

Purification of virus was initially achieved by passing it twice through CsCl step 
gradients with densities of p=1.45 and p=1.20 in a SW28 rotor (Beckman). After isolation of 
the major band in the lower gradient, the virus was passed through a self forming gradient 
(initial p=1.334) at 37.000 rpm for 24 hrs followed by a relaxation of the gradient by 

20 reducing the speed to 10.000 rpm for 10 hrs. in a SW4I rotor (Beckman) at !2°C [Anet and 
Strayer (1969) Biochem. Biophys. Res. Commun. 34:328]. The upper band from the gradient 
(composed mainly of Ad5pdys virions) was isolated using an 18 gauge needle, reloaded on a 
fourth CsCl gradient (p=1.334) and purified at 37.000 rpm for 24 hrs followed by 10.000 rpm 
for 10 hrs at 12°C. 

. The AdSpdys-containinc CsCl band was removed in 100 ^1 fractions from the lop of 

the centrifugation tube and CsCl was removed by chromatography on Sephadex G-50 
(Pharmacia). Aliquots from each fraction were used to infect 293 cells followed by P- 
, galactosidase and alkaline phosphatase assays to quantitate the level of contamination by 
hpAP virions in the final viral isolate. 

Results of the physical separation between Ad5pdys EAMs and hpAP virions are 
shown in Figure 13. Figure 13 shows the physical separation of AdSpdys from hpAP virions 
at the third (A) and final (B) stages of CsCl purification (initial p=l.334) in a SW4I tube. 
Aliquots of AdSpdys EAMs from the final stage were drawn through the top of the 
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centril'uijaiion tube anuassayed for (i-galaciosidase and alkaline phosphatase expression. 

Results of these assays are presented in Figure 14. 

In Figure 14. shows the level of contamination of Ad5pdys EAMs by hpAP virions 

obtained from passage 6, Following four cycles of CsCl purificaiion. aliquots were removed 
5 from the top of the centrifugation tube and used for infection of 293 cells, which were fixed 

24 hrs later and assayed for P-galactosidase or alkaline phosphatase activity (described below). 

The number of transducing particles are an underestimation of the actual totals as in some 

cases a positive cell may have been infected by more than one transducing panicle. The ratio 

of the two types of virions - AdSpdys EAMs (LacZ) or hpAP (AP) in each fraction is 
10 indicated in the lower graph. 

The maximum ratio of transducing AdSpdys EAMs to hpAP virions reproducibly 

achieved in this study was 24.8 a contamination with helper virus corresponding to 

approximately 4% of the final viral isolate. 

15 c) Dystrophin And iS-galactosidase Expression By Encapsidated 

Adenovirus Minichromosomes In Muscle Cells 

To determine if the AdSpdys EAMs were able to express P-galactosidase and 
dystrophin in muscle cells, mouse nuLx myogenic cultures were infected with CsCl purified 
EAMs. 

20 

i) Propagation And Infection Of Muscle Cells 

MM 14 and mdx myogenic cell lines were kindly provided by S. Hauschka (University 
of Washington) and were cultured as previously described [Linkharl ei ctl. (1981) Dev. Biol. 
86:19 and Clegg et al. (1987) J. Cell BioL 105:949]. Briefly, myoblasts were grown on 

25 plastic tissue culture plates coated with a 0.1% gelatin in Ham's F-10 medium containing 15% 
(v/v) horse serum, 0.8 mM CaCK. 200 ng/ml recombinant human basic fibroblast growh 
factor (b-FGF) and 60 |ig/ml genitimicin (proliferation medium). Cutures were induced to 
differentiate by switching to growth in the presence of growth medium lacking b-FGF and 
containing 10% horse serum (differentiation medium). Myoblasts or differentiated myotubes 

30 (three days post switching) were infected at a multiplicity of infection of 2.2 Ad5pdys EAMs 
per cell. Fractions containing minimal contamination with hpAP virions (3. 4 and 5 of 
passage 6) were used for western and immunofluorescence analysis. Infection was allowed to 
proceed for 3 days for both the myoblasts and myotubes before harvesting cells. 
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ii) ^^al Protein Extraction And Immunobiot 
Analysis 

For protein extraction, muscle cells were brietly trypsinized. transferred to a 
microcentrifuge tube, centrifuged at 14 K for 3 min at room temp and resuspended two times 
in PBS. After an additional centrifugation. the cell pellet was resuspended in 80 ul of RIPA 
buffer (50 miM Tris-Cl. pH 7.5: 150 mM NaCl: 1% Nonidei P-40: 0.5% sodium 
deoxycholate: 0.1% SDS) (Sambrook ei ai. supra). The sample was briefly sheared using a 
22 gauge needle to reduce viscosity and total protein concentration assayed using the 
bic.nconinic acid protein assay reagent (Pierce. Rockford. IL). E.xpression of full length 
dystrophin or P-galactosidase in infected mdx and MM 1 4 myoblasts or myotubes was 
analyzed by electrophoresis of 40 of total protein extract on a 6% SDS-PAGE gel (in 25 
mM Tris. 192 mM glycine. 10 mM 3-mercaptoethanoI. 0.1% SDS). After transferring to 
Gelman Biotrace NT membrane (in 25 mM Tris. 192 mM glycine. 10 mM (i-mercaptoethanol. 
0.05% SDS. 20% methanol), the membrane was blocked with 5% non-fat milk and 1% goat- 
serum in Tris-buffered saline-Tween (TBS-T) for 12 hrs at 4°C. Immunosta.ning was done 
according to the protocol for the ECL western blotting detection reagents (Amersham Life 
Sciences. Buckingham. UK). The primary antibodies used were Dys-2 (Vector Laboratories) 
and anti-p-galactosidase (BMB. Indianapolis. IN) with a horseradish peroxidase-conjugated 
anti-mouse secondary antibody. 

Western blot analysis of EAM-infected mcLx myoblasts and myotubes (three days post- 
fusion) indicated that EAMs were able to infect both of these cell types (Figure 15). In 
Figure 15. immunoblots of protein extracts from mdx myoblasts and myotubes demonstrating 
the expression of p-galactosidase (A) and dystrophin (B) in cells infected with Ad5pdys 
EAMs. Total protein was extracted 3 days post infection in ail cases. Myotubes were 
mfected at three days following a switch to differentiation media. In Figure 15A. lane I 
contains total protein extract from 293 cells infected with a virus expressing P-galactosidase 
as a control (RSV-LacZ): lanes 2-5 contain total protein extracts from uninfected mdx 
myoblasts, mdx myoblasts infected with Ad5pdys EAMs, mdx myotubes and mdx myotubes 
derived from mdx myoblasts infected with Ad5pdys EAMs. respectively. In Figure I SB. 
lanes 1 and 7 contain total protein from mouse muscle V'CST) while lane 2 contains protein 
from wild type MM 14 myotubes. as controls. Lanes 3-5 contain total protein extracts from 
uninfected mdx myoblasts, mdx myoblasts infected with Ad5pdys EAMs. mdx myotubes and 
mdx myotubes derived from mdx myoblasts infected with AdSpdys EAMs. respectively. 
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As shown in FTgure 15. expression of p-galactosidase was detected in both the infected 
nulx m\oblasts and myotubes indicating that the CMV promoter was active at both these earlv 
stages of differentiation in muscle cells. However, only infected mdx myotubes produced 
protein detected by dys-2. an antibody recognizing the 17 C-lerminal amino acids of 
dystrophin (Fig. 15). No dystrophin expression was detected in infected myoblasts by western 
analysis, indicating that the muscle creatine kinase promoter functions minimally, if at all, 
within the AdSfJdys EAM prior to terminal differentiation of these cells. 

Dystrophin expression in mdx cells infected with EAMs was confirmed by 
immunofluorescence studies using N-terminal dystrophin antibodies. In agreement with the 
western analysis, dystrophin expression from the MCK promoter was detected only in 
differentiated mdx myotubes infected by AdSpdys EAMs (Fig. 16). Figures I6A-C show 
immunotluorescence of dystrophin in wild type MM14 myotubes (A), uninfected mdx (B) and 
mfected mdx myotubes (C), respectively. The results shown in Figure 16 demonstrate the 
transfer and expression of recombinant dystrophin to differentiated mdx cells by AdSpdys 
15 EAMs. 

Immunofluorescence of myogenic cells was performed as follows. Approximately 1.5 
X lO'^MMH or mdx myoblasts were plated on Poly-L-lysine (Sigma) coated glass slides (7 x 
3 cm) which had been previously etched with a 0.05% chromium potassium sulfate and 0.1% 
gelatin solution. For myotube analysis, the cultures were switched to differentiation media 

20 [Clegg tv <://. (1987). supra] 48 hours after plating, immediately infected and then allowed lo 
fuse for 3 days, whereas myoblasts were continuously propagated in proliferation media 
[Clegg ei id, (1987), supra]. Cells were washed three times with PBS at room temperature 
and fixed in 3.7% formaldehyde. For immunostaining. cells were incubated in 0.5% Triton 
X-IOO. blocked with 1% normal goat serum and incubated with an affinity purified antibody 

25 against the N-terniinus of murine dystrophin for 2 hrs. followed by extensive washing in PBS 
and 0.1% Tween-20 with gentle shaking. Cells were incubated with a 1 :200 dilution of biolin 
conjugated anti-rabbit antibody (Pierce) for one hour and washed as above. Cells were 
lurther incubated with a 1 :300 dilution of streptavidin-fiuorescein isolhiocyanate conjugate 
(Veciorlabs. Burlingam, CA) for one hour and washed as above, followed by extensive 

30 washing in PBS. 

The above results show that embedded inverted Ad origins of replication coupled to an 
encapsidation signal can convert circular DNA molecules to linear forms in the presence of 
helper virus and that these genomes can be efficiently encapsidated and propagated to high 
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(iters. Such viruses cWe purified on a CsCI gradient and mJRmin their ability to transduce 
cells in vino, and their increased cloning capacity allows the inclusion of large genes and 
tissue specific gene regulatory elements. The above results also show that the dystrophin eene 
was expressed in cells transduced by such viruses and that the protein product was correctly 
localized to the cell membrane. The above method for preparing EAMs theoretically enables 
virtually any gene of interest to be inserted into an infectious minichromosome by 
conventional cloning in plasmid vectors, followed by cotrans feet ion with helper xiral DNA in 
293 cells. This approach is useful for a variety of gene transfer studies in vitro. The 
observation that vectors completely lacking viral genes can be used to transfer a full-length 
dystrophin cDNA into myogenic cells indicates that this method may be used for the 
treatment of DMD using gene therapy. 

d) A Modified MCK Enhancer Increases Expression Of Linked 
Genes In iMuscle 

The DNA fragment containing enhancer/promoter of the MCK gene utilized in the 
AdSpdys EAM plasmid is quite large (-3.3. kb). In order to provide a smaller DNA fragment 
capable of directing high levels of expression of linked genes in muscle cells, portions of the 
3.3 kb MCK enhancer/promoter were deleted and/or modified and inserted in front of a 
reporter gene {lacZ). The enhancer element of the MCK gene was modified to produce the 
2RS5 enhancer: the sequence of the 2RS5 enhancer is provided in SEQ ID NO: 10. The first 
6 residues of SEQ ID NO: 10 represent a Kpn\ site added for ease of manipulation of the 
modified MCK enhancer element. Residue number 7 of SEQ ID NO: 10 corresponds to 
residue number 2164 of the wild-type MCK enhancer sequence listed in SEQ ID N0:9 
(position 2164 of SEQ ID NO:9 corresponds to position -1256 of the MCK gene). Residue 
number 174 of SEQ ID NO: 10 corresponds to residue number 2266 of the wild-type MCK 
enhancer sequence listed in SEQ ID NO:9. 

These MCK/lacZ constructs were used to transfect cells in culture {i.e.. myogenic 
cultures) or were injected as naked DNA into the muscle of mice and P-galactosidase activity 
was measured. Figure 1 7 provides a schematic of the MCK/lacZ constructs tested. The first 
construct shown in Figure 17 contains the 3.3. kb wild-type MCK enhancer/promoter 
fragment linked to the E. coli lacZ gene. The wild-type enhancer element (-1256 to -1056) 
is depicted by the box containing "E": the core promoter element (-358 to -80) is indicated by 
the light cross-hatching and the minimal promoter element (-80 to +7) is indicated by the dark 
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cross-hatching. The\>rr(^ promoter element is required in adovcton to the minimal promoter 
element (which is required for basal expression) in order to achieve increased muscle-specific 
expression in tissue culture [Shield a uL (1996) Mol. Cell. Biol. 16:5058]. The modified 
enhancer element, the 2RS5 enhancer, is indicated by the box labeled "E*." The box labeled 
"mi nx" contains a synthetic intron derived from adenovirus and is used to increase expression 
of the constructs (any intron may be utilized for this purpose). The box labeled "CMV" 
depicts the CMV IE enhancer/promoter which was used a positive control. 

In figure 17. p-calactosidase activity is expressed relative to the activity of the wild- 
type MCK enhancer/promoter construct shown at the top of the figure. As shown in Figure 
17. a construct containing the 2RS5 enhancer (SEQ ID NO: 10) linked to either the minimal 
MCK promoter (a - 261 bp element) or the core and minimal MCK promoter elements (a -539 
bp element) directs higher levels of expression of the reporter gene in muscle cells as 
compared to the -3.3 kb fragment containing the wild-type enhancer element. These modified 
enhancer/promoter elements are considerably smaller than the -3.3 kb fragment used in the 
AdSpdys EAM plasmid and are useful for directing the expression of foreign genes in muscle 
cells. These smaller elements are particularly useful for driving the expression of genes in the 
context of self-propagating adenoviral vectors which have more severe constraints on the 
amount of foreign DNA which can be inserted in comparison to the used of "gutted" 
adenoviruses such as the EAMs described above. 



EXAMPLE 7 

Generation Of High Titer Stocks Of Encapsidated Adenovirus 
Minichromosomes Containing Minimal Helper Virus Contamination 

The results presented in Example 6 demonstrated that encapsidated adenovirus 
minichromosomes (EAMs) can be prepared that lack all viral genes and which can express 
full-length dystrophin cDNAs in a muscle specific manner. The propagation of these EAMs 
requires the presence of helper adenoviruses that contaminate the final EAM preparation with 
conventional adenoviruses (about 4% of the total preparation). In this example the EAM 
system is modified to enable the generation of high titer stocks of EAMs with minimal helper 
virus contamination. Preferably the EAM stocks contain helper virus representing less than 
1%, preferably less than 0.1% and most preferably less than 0.01% of the final viral isolate. 
Purified EAMs are then injected in vivo in muscles of dystrophin minus mdx mice to 
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determine whether thWvectors lead to immune rejection and^ether they can alleviate 
dystrophic symptoms in mdx muscles. 

The amount of helper virus present in the EAM preparations is reduced in two ways. 
The first is by selectively controlling the relative packatiiniz efficiency of the helper virus 
versus the EAM virus. The second is to improve physical methods for separatmg EAM from 
helper x irus. These approaches enable the generation of dystrophin-expressing EAMs that are 
contammaied with minimal levels of helper virus. 

a) Development And Characterization Of Adenovirus Packaging 
Cell Lines Expressing The Cre Recombinase From 
Bacteriophage PI 

Cell lines expressing a range of Cre levels are used to optimize the amount of helper 
virus packaging that occurs during growth of the EAM vectors. The Cre-loxP system [Sauer 
and Henderson (1988) Proc. Natl. Acad. Sci. USA 85:5166] is employed to selectively disable 
helper x irus packaging during growth of EAMs. The bacterial Cre recombinase catalyzes 
efficient recombination between a 34 bp target sequence called loxP. To delete a desired 
sequence. loxP sites are placed at each end of the sequence to be deleted in the same 
orientation; in the presence of Cre recombinase the intervening DNA segment is efficiently 
excised. 

Cell lines expressing a range of Cre recombinase levels are generated. The expression 
of too little Cre protein may result in high levels of helper virus being generated, which leads 
to unacceptably high levels of helper virus contaminating the final EAM preparation. If very 
high levels of Cre expression are present in a cell line, excision of the packaging signal from 
the helper virus would be 100% efficient {i.e., it would completely prevent helper virus 
packaging). As shown in Example 6. serial passage of EAM preparations containing low 
levels of helper virus increased the titer of the EAM. Therefore, it is desirable, at least in the 
initial passages of a serial passage that some helper virus capable of being packaged is 
present. A low level of packagable helper virus may be provided by using a cell line 
expressing levels of Cre recombinase which are not high enough to achieve excision of the 
packaging signals from 100% of the helper virus: these cell lines would be used early in the 
serial passaging of the EAM stock and a cell line expressing high enough levels of Cre 
recombinanse to completely prevent helper virus packaging would be used for the final 
passage. 
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Aliernaiively. EvxMs may be prepared using a packaging cell line thai supports hieh 
efficienc\- Cre recombinase-mediaied excision of the packaging signals from the helper virus 
by transfection of the Cre-expressing cell line with the EAM plasmid followed bv infection of 
these cells with /ox/^-conianing helper virus using an MOI of > 1.0. 
5 Human 293 cell lines thai express a variety of levels of Crc recombinase are generated 

as follows. An expression vector containing the Cre coding region. pOG23L was 
cotransfected into 293 cells along with pcDNA3 (Invitrogen): pcDNA3 contains the neo gene 
under the control of the SV40 promoter. pOG231 uses the human CMV enhancer/promoter 
[derived from CDM8 (Invitrogen)] to express a modified Crc gene [pOG231 was obtained 

10 from S. O'Gorman]. The modified Cre gene contained within pOG2jl has had a nuclear 

localization signal inserted into the coding region to increase the efficiency of recombination. 

pOG231 was constructed as follows. A Bglll site was introduced into the 5* Xbal site 
of the synthethic intron of pMLSISCAT [Haung and Gorman (1990) Nucleic Acids Res. 
18:937] by linker tailing. A BglU site in the synthethic intron of pMLSISCAT was destroyed 

15 and a BamHl linker was inserted into the Pstl site at the 3' end of the synthethic intron in 

pMLSISCAT. Bglll and Smal sites and a nuclear localization signal were introduced into the 
5' end of pMC-Cre [Gu et ai (1993) Cell 73:1 155] using a PCR fragment that extended from 
the novel BglW and Sma\ sites to the Bam\\\ site in the Cre coding region. This PCR 
fragment was ligaied to a Bam\\\ISal\ fragment containing a portion of the Cre coding region 

20 derived trom pIC-Cre (Gu e( cil., supra) and the intron plus Cre coding sequence was inserted 
into a modified form of pOG44 [O'Gorman et ai (1991) Science 251:1351] to generate 
pOG231. The predicted sequence of pOG23 1 from the BglW site to the BamHX site located in 
the middle of the O e coding sequence is listed in SEQ ID NO:l 1. 

One 60 mm dish of 293 cells (Microbix) were iransfected with 10 |ag of PvuW- 

25 linearized pOG23I and 1 jag of /Vod-linearized pcDNA3 using a standard calcium phosphate 
precipitation protocol. Two days after the addition of DNA. the transfected cells were split 
into three 100 mm dishes and 1000 \xglm\ of active G418 was added to the medium. The 
cells were fed periodically with G41 8-contaning medium and three weeks later. 24 G418- 
resistant clones were isolated. 

^0 The isolated clones were expanded for testing. Aliquots were frozen in liquid nitrogen 

at the earliest possible passage. The neomycin resistant cell lines were examined for the 
expression of Cre recombinase using following transfection assay. The neomycin resistant 
cells were transfected with PGK-l-GFP-lacZ (obtained from Sally Camper. Univ. of 



- 58 - 



-wo 98/17783- /g^ PCTAJS97/ 19541 

Michiijan. Ann ArboV^I). which contains a green nuorescenfprotein (GFP) expression 
cassette that can be excised by Oc recombinase to allow expression of p-gaiactosidase: 
iransfeclion was accomplised using standard calcium phosphate precipitation. Figure 18 
provides a schematic of a GFP/p-ga! reporter construct suitable for assaymg the expression ot 
Crc recombinase in mammalian cells: GFP sequences and P-gal (i.e.. iacZ) sequences are 
avaialbale from commmerical sources (e.^.. Clonetech. Palo Alto. CA and Pharmacia Biotech, 
Piscataway. NJ, respectively). 

Control experiments verified that 293 cells transfected with PGK- 1 -GFP-lacZ 
expressed significant amounts of P-galactosidase only if these cells also expressed Oe 
recombinanse. p-galactosidase assays were performed as described in Example 6. Neomycin 
resistam ceils expressing Cre recombinase were grouped as high, medium or low expressors 
based upon the amount of P-galactosidase activity produced (estimated by direct counting of 
p-galactosidase-positive cells per high-power field and by observing the level of staining and 
the rapidity with which the blue stain was apparent) when these cell lines were transfected 
with the GFP/p-gal reporter construct. Thirteen positive {i.e.. CVe-expressing) lines. D608#12 
Ul. #22. ^^18. #17. #4, #8. #2, #2/2, #13. #5. #15 and #21 were retained for further use. 

The results of this transfection analysis revealed that cultures of 293 cells expressing 
medium to high levels of Cre recombinase could be generated without apparent toxicity. 

b) Generation Of Helper Adenovirus Strains That Contain loxP 
Sites Flanking The Adenovirus Packaging Signals 

Studies of EAM production demonstrated that the EAM vector has a packaging 
advantage over the helper adenovirus (Ex. 6). While not limiting the present invention to a 
particular mechanism, it is hypothesized that this packaging and replication advantage can be 
greatly increased by using helper viruses that approach the packaging size limits of Ad5 [Bett 
ei al. (1993) J. Virol. 67:591 1], by using viruses with mutations in E4 and/or E2 genes [Yeh 
et cil. (1996) J. Virol. 70:559: Gorziglia et al. (1996) J. Virol. 70:4173: and Amalfitano et al. 
(1996) Proc. Natl. Acad. Sci. USA 93:3352]. by inclusion of mutations or alterations in the 
packaging signals of the helper virus [Imler et al. (1995) Hum. Gene Ther. 6:711] and by 
combining these strategies. 

The Cre-loxP excision method is used to disable the packaging signals from the helper 
virus genomes. The Ad5 packaging domain extends from nucleotide 194 to 358 and is 
composed of five distinct elements that are functionally redundant [Hearing et al. (1987) J. 
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Virol. 61:2555]. Theomically. any molecule containing the ASd origin of replication and 
packaging eiemenis should replicate and be packaged into mature virions in the presence of 
non-defective helper virus. Disabling the packaging signals should allow replication and eene 
expression to proceed, but will prevent packaging of viral DNA into infectious particles. This 
in turn should allow the ratio of EAM to helper virus to be increased greatlv. 

To disable the packaging signals within the helper virus used to encapsidate the EAMs. 
the loxP sequences are incorporated into a helper virus that has a genome approaching the 
maximal packaging size for Ad [-105 map units: Bett at uL (1993). supra] which further 
decreases the efficiency of helper virus packaging. Final virus size can be adjusted by the 
choice of introns inserted into the reponer gene, by the choice of reporter genes, or by 
including a variety of DNA fragments of various sizes to act as "stuffer" fraamenis. A 
convenient reporter gene is the alkaline phosphatase gene (see Ex, 6). The optimized Cre- 
loxP system is also incorporated into a helper viral backbone containing disruptions of anv or 
all of the E1-E4 genes. Use of such deleted genomes requires viral growth on appropriate 
complementing cell lines, such as 293 cells expressing E2, and/or E4 gene products. The 
loxP sequences are incorporated into the helper virus by placing the loxP 
sequences on either side of the packaging signals on a shuttle vector. This modified shuttle 
vector is then used to recombine with the Ad DNA derived from a virus containing 
disruptions of any or all of the E1-E4 genes to produce the desired helper virus containing the 
packaging signals flanked by loxP sequences. 

pAdBglll. an adenovirus shuttle plasmid containing Ad sequences from 0-1 and 9-16 
map units (mu) was used as the starting material. Synthetic oligonucleotides were used to 
create a polylinker which was inserted at I mu within pAdBglll as follows. The Bglll LoxP 
oligo [5*-GAAGATCTATAACTTCGTATAATGTATGCTA 

TACGA-\GTTATTACCGAAGAAATGGCTCGAGATCTTC-3' (SEQ ID NO: 12) and its 
reverese complement [5'-GAAGATCTCGAGCCATTTCTTCGGTAATAACTTCGT 
ATAGCATACATTATACGAAGTTATAGATCTTC-3^ (SEQ ID NO: 13)] were synthesized. 
The AniII LoxP oligo [5'-CCACATGTATAACTTCGTATAGCATACA 
TTATACGAAGTTATACATGTGG-3* (SEQ ID NO: 14)] and its reverse complement [5"- 
CCACATGTATAACTTCGTATAATGTATGCTATACGAAGTTATACATG TGG-3' (SEQ 
ID NO: 1 5)] were synthesized. The double stranded form of each loxP oiigonuceotide was 
digested with the appropriate restriction enzyme {e.g.. Bglll or Ajllll) and inserted into 
pAdBglll which had been digested with Bglll and .1/7111. This resulted in the insertion of 
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LoxP sequences m.o the shuttle vector tlank.ng the packaging signals that are located between 
0.5 and 1 mu of Ad5 (one mu equals 360 bp in the Ad5 genome; the sequence of the Ad5 
genome >s Isi.ed in SEQ ID NO:!). The 3" loxP sequence was mserted mto the Bg[\\ s.te 
within pAdBglll. The 5' loxP sequence was inserted into the A/nU site located at base 143 
(-0.8 mu) (numbering relative to Ad5). 

DNA sequencing was used to verify the final structure of the modified shuttle plasmid 
between 0-1 mu. if Cre recombinase-mediated exc.sion of the packaging signals .s found to 
be too efficient (as judged by the production of too little helper virus), alternate sues for the 
insertion of the loxP sequences are used that would result in deletion of 2. 3 or 4 packaging 
. sequences rather than all 5 [Grable and Hearing (1990) J. Virol. 64:2047]. The insertion of 
loxP sequences at sites along the Ad genome contained within the shuttle vector which would 
result in the deletion 2. 3 or 4 packaging sequences are easily made using the technique of 
recombinant PCR [Htguchi (1990) In: PCR Protocols: A Guide to Methods and 
Applications. Innis et al. (eds.) Academic Press. San Diego. CA. pp. 177-183], The optimal 
amount of CVc recombinase-mediated e.xcision of the packaging signals is that amount which 
permits the production of enough packaged helper virus to permit the slow spread of virus on 
.he first plate of cells co-transfected with helper virus DNA (containing the loxP sequences) 
and the EAM vector. This permits serial passage of the EAM preparation onto a subsequent 
lawn of ceils to increase the titer of the EAM preparation. Alternatively, if Cre recombinase- 
mediated e.xcision of the packaging signals is essentially 100% efficiem in all Cre-expressing 
cells lines (i.e. regardless of the level of C/-e-expression. that is even a cell line expressing a 
low level of Oc as judged by the GFP/p.gal assay described above), the packaged EAMs 
may be used along with helper virus (used at a MOI of ~1.0) to infect the second or 
subsequent lawn of cells to permit serial passaging to increase the titer of the EAM 
preparation. 

Following introduction of the loxP sequences into the shuttle vector, the human 
placental alkaline phosphatase (HpAp) cDNA under control of the RSV promoter was inserted 
into the polylinker to provide a reporter gene for the helper virus. This is the same reporter 
used previously during EAM generation (Ex. 6). The HpAp sequences were inserted as 
follows. The W-containing shuttle vector was linearized with A7;oi and the hpAp cassene 
was ligated into the Xho\ site (a Xho\ site was inserted into pAdBglll during the insertion of 
the loxP sequences as a XJ,o\ site was located on the 3" end of the loxP sequences inserted 
into the fie/II site of pAdBglll). The HpAp cassette was constructed as follows. 
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pRSVhAPT40 (obiainOT^from Gary Nabel. Univ. of Michiean^ Ann Arbor. MI ) was dieested 
with EcoRl to generate an EcoRl fragment containing the HpAP cDNA and the SV40 intron 
and polyadenylation sequences. pRc/RSV (inviirogen) was digested with HindlU, then 
partially digested with £c-oRJ. A 5,208 bp fragment was then size selected on an agarose gel. 
5 treated with calf alkaline phosphatase, and ligated to the EcoRl fragment derived from 

pRSVhAPT40 to generate pRc/RSVAP. pRc/RSVAP was then digested with Safl and A77c?l to 
liberate the RSV promoter linked to the HpAP cDNA cassette (including the SV40 intron and 
polyadenylation sequences). This Sal\-Xhol fragment was inserted into the /av/^-containing 
shuttle vector which had been digested with Sal] and Xho\ lo generate pADLoxP-RSVAP. 

10 Helper virus containing loxP sites flanking the Ad packaging signals is generated by 

co-transfection of the LoxP shuttle plasmid (pADLoxP-RSVAP) and C/t/I-digesied 
Ad5^//7001 DNA into 293 cells [Graham and Prevec (1991) Manipulation of Adenovirus 
Vectors, In Methods in Molecular Biolog}\ Vol. Gene Transfer and Expression Protocols, 
Murray (ed.). Humana Press Inc.. Clifton. NJ. pp. 109-128]. Figure 19 provides a schematic 

15 showing the recombination event between the loxP shuttle vector and the Ad5 J/7001 genome. 
Co-transfection is carried out as described in Example 3c. 

.Alternatively, the reporter gene may be inserted into the E3 region of pBHGlO or 
pBHGl 1 (using the unique Pad site) rather than into the polylinker located in the El region 
of the shuttle vector. The reporter gene-containing pBHGlO or 1 1 is then used in place of 

20 Ad56//7001 for cotransfection of 293 cells along with the loxP-containing pAdBglll. 

Following cotransfection, recombinant plaques are picked, plaque purified, and tested 
for incorporation of both hpAp and loxP sequences by PCR and Southern analysis (Ex. 3). 
Viruses which contain loxP sites flanking the packaging signals and the marker gene (hpAp) 
are retained, propagated and purified. 

25 The isolated helper virus containing loxP sites flanking the packaging signals and the 

marker gene is then used to infect both 293 cells and the 293 cell lines that express Cre 
(section a, above). Cre recombinase-expressing cell lines that produce optimal levels of Cre 
recombinase when infected with the /ar/*-containing helper virus are then used for the 
generation of EAMs as described below. 

30 
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c) Generafion Of Encapsidated Adenovirus Minl^romosomes 
That Express Dystrophin 

The growth of EAMs is optimized using tiie follovvinc methods. In the Hrst method, 
plasmid DNA from the EAM vector (Ex. 6) are co-transfec.ed mto 293 cells w.th purified 
viral DNA from the helper virus, and viruses are harvested 10-14 days later after appearance 
of a viral cytopathic effect (CPE) as described in Example 6. This approach ,s simple, vei 
potentially mcreases helper virus levels by allowing viral spread throughout the culture dishes. 

In the second method, the co-transfected 293 cells are overlaid with agar and single 
plaques are picked and tested for EAM activity by the ability to e.xpress P-galactosidase 
activity following infection of Hela cells (Ex. 6). This second approach is more time 
consuming, but will result in less contamination by helper. 

The ability of these two methods to generate EAMs are directly compared. The 
preferred method is thai which produces the highest titer of EAM with the lowest 
contamination of helper virus. The etTiciency of EAM generation may also be increased by 
using helper virus DNA that retains the terminal protein (TP) {i.e.. the helper virus is used as 
a viral DNA-TPC as described in Ex. 3) The use of viral DNA-TPC has been shown to 
increase the efficiency of viral production following transfection of 293 cells by an order of 
magnitude [Miyake el al. (1996). .siipru]. 

The initial transfection will utilize 293 cells that do not express Cre recombinase. so 
that efficient spread of the helper can lead to large scale production of EAM. The method 
producing the highest ratio of EAM to helper virus is then employed to optimize conditions 
for the serial propagation of the EAM on 293 cells expressing Cre recombinase. This 
optimization is conducted using ceil lines expressing different levels of Cre recombinase. The 
following variables are tested: 1) the ratio of input viral titer to cell number. 2) the number 
^.pf serial passages to use for EAM generation, 3) the use of cell lines producing different 
levels of Cre to achieve the optimal ratio between high EAM tiler and low helper titer. 4) 
continuous growth on Oe-producing 293 cells versus alternating between Cre-producing cells 
and the parental 293 cells, or to alternate between high and low C/e-producing cells and 5) 
CsCl purification of EAMs prior to re-infection of 293 cells increases the ratio of the final 
EAM/helper titers. The protocols that result in the highest yield of EAM with minimal helper 
virus are used to generate large volumes of crude viral lysates tor purification by density 
gradient centrifugaiion (Ex. 6). 
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EAMs were purified from helper virus using standard CsCI density gradient 
centnruuation protocols in Example 6 and resulted in preparations conlaininu -4% helper virus. 
In order to improve the physical separation of EAMs and helper virus, a variety of different 
centrifucation conditions are possible, including changing the gradient shape, type of rotor and 
5 tube used, combinations of step and continuous gradients, and the number of gradients used. 
Materials with better resolving powers than CsCl can also be employed. These include 
rubidium chloride and potassium bromide [Reich and Zarybnicky (1979) Annal. Biochem. 
94:1931. 

10 d) Use Of EAMs Encoding Dystrophin For Long Term 

Expression Of Dystrophin in Muscle 

To demonstrate the ability of the purified dystrophin expressing EAMs (prepared as 
described above) to deliver and express dystrophin in the muscle of an animal, the following 
experiment is conducted. First, purified dystrophin-EAMs are delivered to the muscle by 

15 direct intramuscular injection into newborn. 1 month, and adult (3 month) mouse quadriceps 
muscle. The dystrophin expressing EAM Ad5(3Dys are injected into mcLx mice and into 
transgenic mdx mice that express P-galactosidase in the pituitary gland [Tripathy e( al. (1996) 
Nature Med. 2:545]. These latter mice are used to avoid potential immune-rejection of cells 
expressing p-galactosidase from the Ad5pDys vector. An alternate EAM lacking the P- 

20 galactosidase reporter gene may also be employed: however, the presence of the P-Gal 

reporter simplifies EAM growth and purification (vectors lacking P-Gal have their purity 
estimated by PGR assays rather than by p-Gal assays). 

Following intramuscular injection of EAM. animals are sacrificed at intervals between 
1 week and 6 months to measure dystrophin expression [by western biot analysis and by 

25 immunofluorescence (Phelps e( uL (1995) Hum. MoL Genet. 4:1251 and Rafael e( ai (1996) 
J. Cell Biol. 134:93] and muscle extracts will also be assayed for P-Gal activity [MacGregor 
et ai (1991). supra]. These results are compared with previous results obtained using current 
generation viral vectors {i.e.. containing deletions in El and E3 only) to demonstrate that 
EAMs improve the prospects for long term gene expression in muscle. 

30 
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e) Use Of Dystrophin-EAMs To Prevent, Halt. Or Reverse The 
Dystrophic Symptoms That Develop In The Muscles Of mdx 
Mice 

To demonstrate ihat beneficial effects on dystrophic muscle are achieved by delivery 
of dystrophin expressing EAMs to mdx mice, the following experiments are conducted. 
Central nuclei counts are performed on soieus muscle at intervals followmg injection of EAM 
into newborn. 1 month, and 3 month (adult) mice (Pheips ef ai. 1995), Central nuclei arise 
m mouse muscle only after a myofiber has undergone dystrophic necrosis followed by 
regeneration, and is a quantitative measure of the degree of dystrophy that has occurred in a 
muscle group (Phelps ei ai, 1995). 

More informative assays are also contemplated. These assays require administration of 
E.AMs to mouse diaphragm muscles. 

The diaphragm is severely affected in mdx mice (Stedman ei al.. 1991; Cox et ai, 
1993). and displays dramatic decreases in both force and power generation. Administration of 
\ irus to the diaplvagm will allow the strength of the muscles to be measured at intervals 
following dystrophin delivery. The force and power generating assays developed by the 
Faulkner lab are used to measure the effect of dystrophin transgenes (Shrager ei ai. 1992: 
Rafael ci ai. 1994: Lynch at al.. 1996). 

To detemine whether dystrophin delivery to dystrophic muscle reverses dystrophy or 
stabilizes muscle, varying amounts of the dystrophin EAM are delivered to the to diaphragm 
at different stages of the dystrophic process and then strength is measured at intervals 
following EAM administration. First, various titers of EAM are tested to determine the 
minimal amount of virus needed to transduce the majority of muscle fibers in the diaphragm. 
It has been shown that conventional adenovirus vectors can transduce the majority of 
diaphragm fibers when 10^ pfu are administrated by direct injection into the intraperitoneal 
cavity [Huard ei al. (1995) Gene Therapy 2:107]. In addition, it has been shown that 
transduction of a simple majority of fibers in a muscle group is sufficient to prevent virtually 
all the dystrophic symptoms in mice [Rafael et al. (1994) Hum. Mol. Genet. 3:1725 and 
Phelps er al. (1995) Hum. Mol. Genet. 4:1251]. Virus is administered to mdx animals at three 
different ages (neonatal. 1 month, and 3 months). Animals are sacrificed for physiological 
analysis of diaphragm muscle at two different times post infection ( I month and 3 months). 
Error control is achieved by performing these experiments in sextuplicate. Comrol animals 
consist of mock injected wild-type (C57BI/10) and dystrophic (mdx) mice. Tliree month old 
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fndx nuce display a 40% reduction in force and power generation compared with wild-type 
mice, while 6 month animals display a greater than 50% reduction (Cox ci al. (1993) Nature 
364:725; Rafael et cti (1^94), supra; Phelps ct al. (1995). supra: and Corrado a al. (1996) J. 
Cell. Biol. 134:873]. 



EXAMPLE 8 

Improved Shuttle Vectors For The 
Production Of Helper Virus Containing LoxP Sites 



10 In the previous example, a shuttle vector containing adenoviral sequences extending 

from 9 mu to 16 mu of the Ad5 genome was modified to contain loxP sequences surrounding 
the packaging signals (the LoxP shuttle vector). This modified shuttle vector was then 
recombined with an Ad virus to produce a helper virus containing the loxP sequences. This 
helper virus was then used to infect cells expressing Cre recombinase along with DNA 

1 5 comprising a minichromosome containing the dystrophin gene, a reporter gene and the 

packaging signals and ITRs of Ad in order to preferentially package the minichromosomes. 
Using this approach the helper virus, which has had the packaging signals removed by Cre- 
loxP recombination, contains the majority of the Ad genome (only a portion of the E3 region 
is deleted). Thus, if low levels of helper virus are packaged and appear in the EAM 

20 preparation, the EAM preparation has the potential of passing on helper virus capable of 

directing the expression of Ad proteins in cells which are exposed to the EAM preparation. 
The expression of Ad proteins may lead to an immune response directed against the infected 
cells. 

Another approach to reducing the possibility that the EAM preparation contains helper 
25 virus capable of provoking an immune response is to use helper viruses containing deletions 
and/or mutations within the pol and pTP genes. Helper virus containing a deletion in the pol 
and/or pTP genes is coiransfected with the EAM construct into 293-derived cell lines 
expressing pol or pol and pTP to produce EAMs. Any helper virus present in the purified 
EAM preparation will be replication defective due to the deletion in the pol and/or pTP genes. 
30 As shown in Example 3, viruses containing a deletion in the pol gene are incapable of 

directing the expression of viral late genes: therefore, helper viruses containing a deletion in 
the pol gene or the pol and pTP genes should not be capable of provoking an immune 
response (i.e., a CTL response) against late viral proteins synthesized de novo. Shuttle 
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vectors contain deletion^ithin the Ad pel and/or pTP genes are constructed as described 
below. 

a) Construction Of A Shuttle Vector Containing The Apol 
Deletion 

pAdBglM was modified lo contain sequences corresponding to 9 to 40 mu of the Ad5 
genome as follows. pAdBglll was digested with Bg/U and a linker/adapter containing an Ascl 
site was added to create pAdBgUIAsc. pAdBglllAsc was then digested with Bsf\ 1071. 
Ad5<//7001 viral DNA was digested with Ascl and the ends were filled in using T4 DNA 
polymerase. The .4.vcl -digested. T4 DNA polymerase filled Ad5J/700l viral DNA was then 
digested with BslWOll and the -9.9 kb Asc\-Bs(\ 107l(fil]ed) fragment contaming the pol and 
pTP genes was isolated (as described in Ex. 3) and ligated to the Bs(\ 1071-digested 
pAdBglllAsc to generate pAdAsc. pAdAsc is a shuttle vector which contains the genes 
encoding the DNA polymerase and preterminal protein (the inserted A.scl-B.u\ 1071 fragment 
corresponds to nucleotides 5767 to 15. 671 of the Ad5 genome). 

A shuttle vector. pAdAscApol. which contains the 612 bp deletion in the pol gene 
(described in Ex. 3) was constructed as follows. Ad5ApolAE3l viral DNA (Ex. 3) was 
digested with Ascl and the ends were filled in using T4 DNA polymerase. The /l5cl -digested. 
T4 DNA polymerase filled Ad5ApolAE3I viral DNA was then digested with Bs(\ 1071 and the 
-9.3 kb Ascl-Bsil 107I(filIed) fragment containing the deleted pol gene and the pTP gene was 
isolated (as described in Ex. 3) and ligated to 5^/1 1071-digested pAdBglllAsc to generate 
pAdAscApol. 

b) Construction Of A Shuttle Vector Containing The ApoiApTP 
Deletion 

A shuttle vector. pAdAscApolApTP. which contains a 2.3 kb deletion within the pol 

25 and pTP genes (described in Ex. 5) was constructed as follows. 

pAXBApolApTPVARNA+tl3 (Ex. 5b) was digested with Ascl and the ends were filled in 
using T4 DNA polymerase. The .-iicl-digested. T4 DNA polymerase filled 
pAXBApolApTPVARNA+tl3 DNA was then digested with Bsil 1071 and the -7.6 kb Ascl- 
5.V/1 107I(filled) fragment containing the deleted pol gene and the pTP gene was isolated (as 

30 described in Ex. 3) and ligated to Bst\ I071.digested pAdBglllAsc to generate 
pAdAscApol ApTP. 

In order to reduce the packaging of the above helper viruses, the pol" or pol". pTP" 
helper viruses can be modified to incorporate loxP sequences on either side of the packaging 
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signals as outlined in Example 7. The /ox/^-containing poT or poP. pTP~ shunle vectors 
(pTP- shuttle vectors may also be employed) are cotranstected into 293 cells expressing Cre 
rccombinase and pol or Cre recombinase. pol and pTP. respectively along with an appropriate 
El" viral DNA-TPC (the El - viral DNA may also contain deletions elsewhere m the genome 
such as in the E4 genes or in the E2a gene as packaging cell lines expressing the E4 ORF 6 
or 6/7 and lines expressing E2a genes are avaialble) to generate helper virus containing loxP 
sites Hanking the packaging signals as well as a deletion in the pol gene or the pol and pTP 
genes. The resulting helper virus(es) is used to coiranstect 293 cells expressing pol or pol 
and pTP along with the desired EAM construct. The resultirLg EAM preparation should 
contain little if any helper virus and any contaminating helper virus present would be 
replication defective and incapable of expressing viral late gene products. Helper viruses 
containing loxP sequences and deletions in all essential early genes may be emploved in 
conjunction with Cre recombinanse-expressing cell lines expressing in trans the El. E4 ORF 
6. E2a. and E2b (e,g,. Ad polymerase and pTP) proteins (the E3 proteins are dispensible for 
groulh in culture). Cell lines coexpressing El, polymerase and pTP are provided herein. 
Cell lines expressing El and E4 proteins have been recently described [Krougliak and Graham 
(1995) Hum. Gene Ther. 6:1575 and Wang e( ai (1995) Gene Ther. 2:775] and cell lines 
expressing El and E2a proteins have been recently described [Zhou et al. (1996) J. Virol. 
70:7030]. Therefore, a cell line co-expressing EL E2a. E2b. and E4 is consructed by 
introduction of expression plasmids containing the E2a and E4 coding regions into the E1-. 
Ad polymerase- and pTP-expressing cell lines of the present invention. These packaging cell 
lines are used in conjunction with helper viruses containing deletions in the EL E2a. E2b and 
E4 regions. 

All publications and patents mentioned in the above specification are herein 
incorporated by reference. Various modifications and variations of the described method and 
system of the invention will be apparent to those skilled in the art without depaning from the 
scope and spirit of the invention. Although the invention has been described in connection 
with specific preferred embodiments, it should be understood that the invention as claimed 
should not be unduly limited to such specific embodiments. Indeed, various modifications of 
the described modes for carrying out the invention which are obvious to those skilled in 
molecular biology or related fields are intended to be within the scope of the following 
claims. 



- 68 - 



wo 98/17783 




PCT/US97/19541 



SEQUENCE LISTING 

(1) GENERAL INFORMATION: 



(1) APPLICANT: Chamberlain, Jeffrey S. 

Amalfitano, Andrea 

Hauser, Michael A. 

Kumar - S mgh . Ra j endra 

Hart igan-O ' Connor , Dennis J. 

(li) TITLE OF INVENTION: IMPROVED ADENOVIRUS VECTORS 

(iii) NUMBER OF SEQUENCES: 15 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Medlen & Carroll, LLP 

(B) STREET: 220 Montgomery Street, Suite 2200 

(C) CITY: San Francisco 

(D) STATE: California 

(E) COUNTRY: United States Of America 

(F) ZIP: 94104 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC - DOS /MS - DOS 

(D) SOFTWARE: Patentin Release ^^1.0, Version \ 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 

(C) CLASSIFICATION : 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Ingolia, Diane E. 

(B) REGISTRATION NUMBER: 40,027 

(C) REFERENCE/DOCKET NUMBER- UM-02484 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (415) 705-8410 

(B) TELEFAX: (415) 397-8338 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35935 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS t double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



CATCATCAAT 


AATATACCTT 


ATTTTGGATT 


GAAGCCAATA 


TGATAATGAG 


GGGGTGGAGT 


60 


TTGTGACGTG 


GCGCGGGGCG 


TGGGAACGGG 


GCGGGTGACG 


TAGTAGTGTG 


GCGGAAGTGT 


120 


GATGTTGCAA 


GTGTGGCGGA 


ACACATGTAA 


GCGACGGATG 


TGGCAAAAGT 


GACGTTTTTG 


180 


GTGTGCGCCG 


GTGTACACAG 


GAAGTGACAA 


TTTTCGCGCG 


GTTTTAGGCG 


GATGTTGTAG 


240 


TAAATTTGGG 


CGTAACCGAG 


TAAGATTTGG 


CCATTTTCGC 


GGGAAAACTG 


AATAAGAGGA 


300 


AGTGAAATCT 


GAATAATTTT 


GTGTTACTCA 


TAGCGCGTAA 


TATTTGTCTA 


GGGCCGCGGG 


360 


GACTTTGACC 


GTTTACGTGG 


AGACTCGCCC 


AGGTGTTTTT 


CTCAGGTGTT 


TTCCGCGTTC 


420 


CGGGTCAAAG 


TTGGCGTTTT 


ATTATTATAG 


TCAGCTGACG 


TGTAGTGTAT 


TTATACCCGG 


480 
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TGAGTTCCTC AAGAGGCCAC TCTTGAGTGC CAGCGAGTAG AGTTTTCTCC TCCGAGCCGC 54 0 

TCCGACACCG GGACTGAAAA TGAGACATAT TATCTGCCAC GGAGGTGTTA TTACCGAAGA 6 00 

AATGGCCGCC AGTCTTTTGG ACCAGCTGAT CGAAGAGGTA CTGGCTGATA ATCTTCCACC 6 60 

TCCTAGCCAT TTTGAACCAC CTACCCTTCA CGAACTGTAT GATTTAGACG TGACGGCCCC 720 

CGAAGATCCC AACGAGGAGG CGGTTTCGCA GATTTTTCCC GACTCTGTAA TGTTGGCGGT 78 0 

GCAGGAAGGG ATTGACTTAC TCACTTTTCC GCCGGCGCCC GGTTCTCCGG AGCCGCCTCA 84 0 

CCTTTCCCGG CAGCCCGAGC AGCCGGAGCA GAGAGCCTTG GGTCCGGTTT CTATGCCAAA 9 00 

CCTTGTACCG GAGGTGATCG ATCTTACCTG CCACGAGGCT GGCTTTCCAC CCAGTGACGA 96 0 

CGAGGATGAA GAGGGTGAGG AGTTTGTGTT AGATTATGTG GAGCACCCCG GGCACGGTTG 1020 

CAGGTCTTGT CATTATCACC GGAGGAATAC GGGGGACCCA GATATTATGT GTTCGCTTTG 10 80 

CTATATGAGG ACCTGTGGCA TGTTTGTCTA CAGTAAGTGA AAATTATGGG CAGTGGGTGA 114 0 

TAGAGTGGTG GGTTTGGTGT GGTAATTTTT TTTTTAATTT TTACAGTTTT GTGGTTTAAA 12 00 

GAATTTTGTA TTGTGATTTT TTTAAAAGGT CCTGTGTCTG AACCTGAGCC TGAGCCCGAG 1260 

CCAGAACCGG AGCCTGCAAG ACCTACCCGC CGTCCTAAAA TGGCGCCTGC TATCCTGAGA 13-2 0 

CGCCCGACAT CACCTGTGTC TAGAGAATGC AATAGTAGTA CGGATAGCTG TGACTCCGGT 13 80 

CCTTCTAACA CACCTCCTGA GATACACCCG GTGGTCCCGC TGTGCCCCAT TAAACCAGTT 144 0 

GCCGTGAGAG TTGGTGGGCG TCGCCAGGCT GTGGAATGTA TCGAGGACTT GCTTAACGAG 15 00 

CCTGGGCAAC CTTTGGACTT GAGCTGTAAA CGCCCCAGGC CATAAGGTGT AAACCTGTGA 156 0 

TTGCGTGTGT GGTTAACGCC TTTGTTTGCT GAATGAGTTG ATGTAAGTTT AATAAAGGGT 1620 

GAGATAATGT TTAACTTGCA TGGCGTGTTA AATGGGGCGG GGCTTAAAGG GTATATAATG 16 80 

CGCCGTGGGC TAATCTTGGT TACATCTGAC CTCATGGAGG CTTGGGAGTG TTTGGAAGAT 1740 

TTTTCTGCTG TGCGTAACTT GCTGGAACAG AGCTCTAACA GTACCTCTTG GTTTTGGAGG 18 00 

TTTCTGTGGG GCTCATCCCA GGCAAAGTTA GTCTGCAGAA TTAAGGAGGA TTACAAGTGG 186 0 

GAATTTGAAG AGCTTTTGAA ATCCTGTGGT GAGCTGTTTG ATTCTTTGAA TCTGGGTCAC 192 0 

CAGGCGCTTT TCCAAGAGAA GGTCATCAAG ACTTTGGATT TTTCCACACC GGGGCGCGCT 19 80 

GCGGCTGCTG TTGCTTTTTT GAGTTTTATA AAGGATAAAT GGAGCGAAGA AACCCATCTG 2 04 0 

AGCGGGGGGT ACCTGCTGGA TTTTCTGGCC ATGCATCTGT GGAGAGCGGT TGTGAGACAC 2100 

AAGAATCGCC TGCTACTGTT GTCTTCCGTC CGCCCGGCGA TAATACCGAC GGAGGAGCAG 2160 

CAGCAGCAGC AGGAGGAAGC CAGGCGGCGG CGGCAGGAGC AGAGCCCATG GAACCCGAGA 2220 

GCCGGCCTGG ACCCTCGGGA ATGAATGTTG TACAGGTGGC TGAACTGTAT CCAGAACTGA 22 8 0 

GACGCATTTT GACAATTACA GAGGATGGGC AGGGGCTAAA GGGGGTAAAG AGGGAGCGGG 2 3 40 

GGGCTTGTGA GGCTACAGAG GAGGCTAGGA ATCTAGC7TT TAGCTTAATG ACCAGACACC 24 00 

GTCCTGAGTG TATTACTTTT CAACAGATCA AGGATAATTG CGCTAATGAG CTTGATCTGC 24 6 0 

TGGCGCAGAA GTATTCCATA GAGCAGCTGA CCACTTACTG GCTGCAGCCA GGGGATGATT 25 20 

TTGAGGAGGC TATTAGGGTA TATGCAAAGG TGGCACTTAG GCCAGATTGC AAGTACAAGA 2 5 80 
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TCAGCA.^CT TGTAA^C AGGAATTGTT GCTACATTTC TGGGAACGGG GCCGAGGTGG 2640 
AGATAGATAC GGAGGATAGG GTGGCCTTTA GATGTAGCAT GATAAATATG TGGCCGGGGG 2700 
TGCTTGGCAT GGACGGGGTG GTTATTATGA ATGTAAGGTT TACTGGCCCC AATTTTAGCG 2 76 0 

GTACGGTTTT CCTGGCCAAT ACCAACCTTA TCCTACACGG TGTAAGCTTC TATGGGTTTA 2820 
ACAATACCTG TGTGGAAGCC TGGACCGATG TAAGGGTTCG GGGCTGTGCC TTTTACTGCT 2880 
GCTGGAAGGG GGTGGTGTGT CGCCCCAAAA GCAGGGCTTC AATTAAGAAA TGCCTCTTTG 2 94 0 
AAAGGTGTAC GTTGGGTATC CTGTCTGAGG GTAACTCCAG GGTGCGCCAC AATGTGGCCT ]ooo 
CCGACTGTGG TTGCTTCATG CTAGTGAAAA GCGTGGCTGT GATTAAGCAT AACATGGTAT 
GTGGCAACTG CGAGGACAGG GCCTCTCAGA TGCTGACCTG CTCGGACGGC AACTGTCACC 
TGCTGAAGAC CATTCACGTA GCCAGCCACT CTCGCAAGGC CTGGCCAGTG TTTGAGCATA 
ACATACTGAC CCGCTGTTCC TTGCATTTGG GTAACAGGAG GGGGGTGTTC CTACCTTACC 
AATGCAATTT GAGTGACACT AAGATATTGC TTGAGCCCGA GAGCATGTCC AAGGTGAACC 
TGAACGGGGT GTTTGACATG ACCATGAAGA TCTGGAAGGT GCTGAGGTAC GATGAGACCC 
GCACCAGGTG CAGACCCTGC GAGTGTGGCG GTAAACATAT TAGGAACCAG CCTGTGATGC 
TGGATGTGAC CGAGGAGCTG AGGCCCGATC ACTTGGTGCT GGCCTGCACC CGCGCTGAGT 
TTGGCTCTAG CGATGAAGAT ACAGATTGAG GTACTGAAAT GTGTGGGCGT GGCTTAAGGG 
TGGGAAAGAA TATATAAGGT GGGGGTCTTA TGTAGTTTTG TATCTGTTTT GCAGCAGCCG 
CCGCCGCCAT GAGCACCAAC TCGTTTGATG GAAGCATTGT GAGCTCATAT TTGACAACGC 
GCATGCCCCC ATGGGCCGGG GTGCGTCAGA ATGTGATGGG CTCCAGCATT GATGGTCGCC 3 72 0 

CCGTCCTGCC CGCAAACTCT ACTACCTTGA CCTACGAGAC CGTGTCTGGA ACGCCGTTGG 3780 
AGACTGCAGG CTCCGCCGCC GCTTCAGCCG CTGCAGCCAC CGCCCGCGGG ATTGTGACTG 
ACTTTGCTTT CCTGAGCCCG CTTGCAAGCA GTGCAGCTTC CCGTTCATCC GCCCGCGATG 
ACAAGTTGAC GGCTCTTTTG GCACAATTGG ATTCTTTGAC CCGGGAACTT AATGTCGTTT 
CfCAGCAGCT GTTGGATCTG CGCCAGCAGG TTTCTGCCCT GAAGGCTTCC TCCCCTCCCA 
ATGCGGTTTA AAACATAAAT AAAAAACCAG ACTCTGTTTG GATTTGGATC AAGCAAGTGT 
CTTGCTGTCT TTATTTAGGG GTTTTGCGCG CGCGGTAGGC GCGGGACCAG CGGTCTCGGT 
CGTTGAGGGT CCTGTGTATT TTTTCCAGGA CGTGGTAAAG GTGACTCTGG ATGTTCAGAT 4200 
ACATGGGCAT AAGCCCGTCT CTGGGGTGGA GGTAGCACCA CTGCAGAGCT TCATGCTGCG 4260 
GGGTGGTGTT GTAGATGATC CAGTCGTAGC AGGAGCGCTG GGCGTGGTGC CTAAAAATGT 4 320 
CTTTCAGTAG CAAGCTGATT GCCAGGGGCA GGCCCTTGGT GTAAGTGTTT ACAAAGCGGT 4380 
TAAGCTGGGA TGGGTGCATA CGTGGGGATA TGAGATGCAT CTTGGACTGT ATTTTTAGGT 444 0 
TGGCTATGTT CCCAGCCATA TCCCTCCGGG GATTCATGTT GTGCAGAACC ACCAGCACAG 4500 
TGTATCCGGT GCACTTGGGA AATTTGTCAT GTAGCTTAGA AGGAAATGCG TGGAAGAACT 4560 
TGGAGACGCC CTTGTGACCT CCAAGATTTT CCATGCATTC GTCCATAATG ATGGCAATGG 4620 
GCCCACGGGC GGCGGCCTGG GCGAAGATAT TTCTGGGATC ACTAACGTCA TAGTTGTGTT 4 6 80 



3 06 0 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 



3840 
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CCAGGATGAG ATCGTC^Rg GCCATTTTTA CAAAGCGCGG GCGGAGG^G CCAGACTGCG 4740 

GTATAATGGT TCCATCCGGC CCAGGGGCGT AGTTACCCTC ACAGATTTGC ATTTCCCACG 4 8 00 

CTTTGAGTTC AGATGGGGGG ATCATGTCTA CCTGCGGGGC GATGAAGAAA ACGGTTTCCG 4 86 0 

GGGTAGGGGA GATCAGCTGG GAAGAAAGCA GGTTCCTGAG CAGCTGCGAC TTACCGCAGC 4 92 0 

CGGTGGGCCC GTAAATCACA CCTATTACCG GGTGCAACTG GTAGTTAAGA GAGCTGCAGC 4 98 0 

TGCCGTCATC CCTGAGCAGG GGGGCCACTT CGTTAAGCAT GTCCCTGACT CGCATGTTTT 504 0 

CCCTGACCAA ATCCGCCAGA AGGCGCTCGC CGCCCAGCGA TAGCAGTTCT TGCAAGGAAG 510 0 

CAAAGTTTTT CAACGGTTTG AGACCGTCCG CCGTAGGCAT GCTTTTGAGC GTTTGACCAA 516 0 

GCAGTTCCAG GCGGTCCCAC AGCTCGGTCA CCTGCTCTAC GGCATCTCGA TCCAGCATAT 52 2 0 

CTCCTCGTTT CGCGGGTTGG GGCGGCTTTC GCTGTACGGC AGTAGTCGGT GCTCGTCCAG 528 0 

ACGGGCCAGG GTCATGTCTT TCCACGGGCG CAGGGTCCTC GTCAGCGTAG TCTGGGTCAC 5 34 0 

GGTGAAGGGG TGCGCTCCGG GCTGCGCGCT GGCCAGGGTG CGCTTGAGGC TGGTCCTGCT 54 00 

GGTGCTGAAG CGCTGCCGGT CTTCGCCCTG CGCGTCGGCC AGGTAGCATT TGACCATGGT 54 6 0 

GTCATAGTCC AGCCCCTCCG CGGCGTGGCC CTTGGCGCGC AGCTTGCCCT TGGAGGAGGC 552 0 

GCCGCACGAG GGGCAGTGCA GACTTTTGAG GGCGTAGAGC TTGGGCGCGA GAAATACCGA 5 58 0 

TTCCGGGGAG TAGGCATCCG CGCCGCAGGC CCCGCAGACG GTCTCGCATT CCACGAGCCA 564 0 

GGTGAGCTCT GGCCGTTCGG GGTCAAAAAC CAGGTTTCCC CCATGCTTTT TGATGCGTTT 5 70 0 

CTTACCTCTG GTTTCCATGA GCCGGTGTCC ACGCTCGGTG ACGAAAAGGC TGTCCGTGTC 5 76 0 

CCCGTATACA GACTTGAGAG GCCTGTCCTC GAGCGGTGTT CCGCGGTCCT CCTCGTATAG 58 2 0 

AAACTCGGAC CACTCTGAGA CAAAGGCTCG CGTCCAGGCC AGCACGAAGG AGGCTAAGTG 58 8 0 

GGAGGGGTAG CGGTCGTTGT CCACTAGGGG GTCCACTCGC TCCAGGGTGT GAAGACACAT 5 94 0 

GTCGCCCTCT TCGGCATCAA GGAAGGTGAT TGGTTTGTAG GTGTAGGCCA CGTGACCGGG 6 00 0 

TGTTCCTGAA GGGGGGCTAT AAAAGGGGGT GGGGGCGCGT TCGTCCTCAC TCTCTTCCGC 6 06 0 

ATCGCTGTCT GCGAGGGCCA GCTGTTGGGG TGAGTACTCC CTCTGAAAAG CGGGCATGAC 612 0 

TTCTGCGCTA AGATTGTCAG TTTCCAAAAA CGAGGAGGAT TTGATATTCA CCTGGCCCGC 618 0 

GGTGATGCCT TTGAGGGTGG CCGCATCCAT CTGGTCAGAA AAGACAATCT TTTTGTTGTC 624 0 

AAGCTTGGTG GCAAACGACC CGTAGAGGGC GTTGGACAGC AACTTGGCGA TGGAGCGCAG 6 3 00 

GGTTTGGTTT TTGTCGCGAT CGGCGCGCTC CTTGGCCGCG ATGTTTAGCT GCACGTATTC 6 36 0 

GCGCGCAACG CACCGCCATT CGGGAAAGAC GGTGGTGCGC TCGTCGGGCA CCAGGTGCAC 642 0 

GCGCCAACCG CGGTTGTGCA GGGTGACAAG GTCAACGCTG GTGGCTACCT CTCCGCGTAG 648 0 

GCGCTCGTTG GTCCAGCAGA GGCGGCCGCC CTTGCGCGAG CAGAATGGCG GTAGGGGGTC 6 54 0 

TAGCTGCGTC TCGTCCGGGG GGTCTGCGTC CACGGTAAAG ACCCCGGGCA GCAGGCGCGC 6 600 

GTCGAAGTAG TCTATCTTGC ATCCTTGCAA GTCTAGCGCC TGCTGCCATG CGCGGGCGGC 666 0 

AAGCGCGCGC TCGTATGGGT TGAGTGGGGG ACCCCATGGC ATGGGGTGGG TGAGCGCGGA 6 72 0 

GGCGTACATG CCGCAAATGT CGTAAACGTA GAGGGGCTCT CTGAGTATTC CAAGATATGT 6 78 0 
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AGGGTAGCAT CTTCCAcW^ GGATGCTGGC GCGCACGTAA TCGTATAGTT CGTGCGAGGG 5 84 0 

AGCGAGGAGG TCGGGACCGA GGTTGCTACG GGGGGGCTGC TCTGCTCGGA AGACTATCTG S900 
CCTGAAGATG GCATGTGAGT TGGATGATAT GGTTGGACGC TGGAAGACGT TGAAGCTGGC 
GTCTGTGAGA CCTACCGCGT CACGCACGAA GGAGGCGTAG GAGTCGCGCA GCTTGTTGAC 
CAGCTCGGCG GTGACCTGCA CGTCTAGGGC GCAGTAGTCC AGGGTTTCCT TGATGATGTC 
ATACTTATCC TGTCCCTTTT TTTTCCACAG CTCGCGGTTG AGGACAAACT CTTCGCGGTC 
TTTCCAGTAC TCTTGGATCG GAAACCCGTC GGCCTCCGAA CGGTAAGAGC CTAGCATGTA 
GAACTGGTTG ACGGCCTGGT AGGCGCAGCA TCCCTTTTCT ACGGGTAGCG CGTATGCCTG 726 0 

CGCGGCCTTC CGGAGCGAGG TGTGGGTGAG CGCAAAGGTG TCCCTGACCA TGACTTTGAG 73 2 0 

GTACTGGTAT TTGAAGTCAG TGTCGTCGCA TCCGCCCTGC TCCCAGAGCA AAAAGTCCGT 7380 
GCGCTTTTTG GAACGCGGAT TTGGCAGGGC GAAGGTGACA TCGTTGAAGA GTATCTTTCC 744 0 

CGCGCGAGGC ATAAAGTTGC GTGTGATGCG GAAGGGTCCC GGCACCTCGG AACGGTTGTT 7 50 0 
AATTACCTGG GCGGCGAGCA CGATCTCGTC AAAGCCGTTG ATGTTGTGGC CCACAATGTA 
AAGTTCC.AAG AAGCGCGGGA TGCCCTTGAT GGAAGGCAAT TTTTTAAGTT CCTCGTAGGT 
GAGCTCTTCA GGGGAGCTGA GCCCGTGCTC TGAAAGGGCC CAGTCTGCAA GATGAGGGTT 
GGAAGCGACG AATGAGCTCC ACAGGTCACG GGCCATTAGC ATTTGCAGGT GGTCGCGAAA 7 74 0 

GGTCCTAAAC TGGCGACCTA TGGCCATTTT TTCTGGGGTG ATGCAGTAGA AGGTAAGCGG 7800 
GTCTTG7TCC CAGCGGTCCC ATCCAAGGTT CGCGGCTAGG TCTCGCGCGG CAGTCACTAG 786 0 

AGGCTCATCT CCGCCGAACT TCATGACCAG CATGAAGGGC ACGAGCTGCT TCCCAAAGGC 7 920 

CCCCATCCAA GTATAGGTCT CTACATCGTA GGTGACAAAG AGACGCTCGG TGCGAGGATG 7 9 80 

CGAGCCGATC GGGAAGAACT GGATCTCCCG CCACCAATTG GAGGAGTGGC TATTGATGTG 8040 
GTGAAAGTAG AAGTCCCTGC GACGGGCCGA ACACTCGTGC TGGCTTTTGT AAAAACGTGC 
GCAGTACTGG CAGCGGTGCA CGGGCTGTAC ATCCTGCACG AGGTTGACCT GACGACCGCG 
CACAAGGAAG CAGAGTGGGA ATTTGAGCCC CTCGCCTGGC GGGTTTGGCT GGTGGTCTTC 
TACT7CGGCT GCTTGTCCTT GACCGTCTGG CTGCTCGAGG GGAGTTACGG TGGATCGGAC 
CACCACGCCG CGCGAGCCCA AAGTCCAGAT GTCCGCGCGC GGCGGTCGGA GCTTGATGAC 
AACATCGCGC AGATGGGAGC TGTCCATGGT CTGGAGCTCC CGCGGCGTCA GGTCAGGCGG 84 00 

GAGCTCCTGC AGGTTTACCT CGCATAGACG GGTCAGGGCG CGGGCTAGAT CCAGGTGATA 8460 
CCTAATTTCC AGGGGCTGGT TGGTGGCGGC GTCGATGGCT TGCAAGAGGC CGCATCCCCG 8 520 

CGGCGCGACT ACGGTACCGC GCGGCGGGCG GTGGGCCGCG GGGGTGTCCT TGGATGATGC 8 580 

ATCTAAAAGC GGTGACGCGG GCGAGCCCCC GGAGGTAGGG GGGGCTCCGG ACCCGCCGGG 
AGAGGGGGCA GGGGCACGTC GGCGCCGCGC GCGGGCAGGA GCTGGTGCTG CGCGCGTAGG 
TTGCTGGCGA ACGCGACGAC GCGGCGGTTG ATCTCCTGAA TCTGGCGCCT CTGCGTGAAG 
ACGACGGGCC CGGTGAGCTT GAGCCTGAAA GAGAGTTCGA CAGAATCAAT TTCGGTGTCG 8 8 20 

TTGACGGCGG CCTGGCGCAA AATCTCCTGC ACGTCTCCTG AGTTGTCTTG ATAGGCGATC 8880 
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TCGGCCATGA 

GTGGCGGCGA 
TCGTTCCAGA 
TGCGCGAGAT 
AGGTAGTTGA 
AACGTGGATT 
ACGGCGAAGT 
CGGATGAGCT 
TCTTCTTCAA 
GGAGGGGGGA 
ATCTCCCCGC 
AGTTGGAAGA 
AGGGATACGG 
GACCTGAGCG 
TCACAGTCGC 
TTTCTGGCGG 
GTCGACAGAA 
CCCCAGGCTT 
ACCGGCACTT 
GCGGCGGAGT 
CTCATCGGCT 
ACCTGCGTGA 
TTGATGGTGT 
GAGAGCTCGG 
GTCCGCACCA 
CAGCGTAGGG 
TAGATGTACC 
CGGACGCGGT 
CCGGTCAGGC 
GGCACTCTTC 
TCGAGCCCCG 
CAGGTGTGCG 
GCTGCTGCGC 
GCGAAAGCAT 
GCGGGACCCC 



ACTGCTC^?^ 
GGTCGTTGGA 
CGCGGCTGTA 
TGAGCTCCAC 
GGGTGGTGGC 
CGTTGATATC 
TGAAAAACTG 
CGGCGACAGT 
TCTCCTCTTC 
CACGGCGGCG 
GGCGACGGCG 
CGCCGCCCGT 
CGCTAACGAT 
AGTCCGCATC 
AAGGTAGGCT 
AGGTGCTGCT 
GCACCATGTC 
CGTTTTGACA 
CTTCTTCTCC 
TTGGCCGTAG 
GAAGCAGGGC 
GGGTAGACTG 
AAGTGCAGTT 
TGTACCTGAG 
GGTACTGGTA 
TGGCCGGGGC 
TGGACATCCA 
TCCAGATGTT 
GCGCGCAATC 
CGTGGTCTGG 
TATCCGGCCG 
ACGTCAGACA 
TAGCTTTTTT 
TAAGTGGCTC 
CGGTTCGAGT 



CTCTTCCTCC 
AATGCGGGCC 
GACCACGCCC 
GTGCCGGGCG 
GGTGTGTTCT 
CCCCAAGGCC 
GGAGTTGCGC 
GTCGCGCACC 
CATAAGGGCC 
ACGACGGCGC 
CATGGTCTCG 
CATGTCCCGG 
GCATCTCAAC 
GACCGGATCG 
GAGCACCGTG 
GATGATGTAA 
CTTGGGTCCG 
TCGGCGCAGG 
TTCCTCTTGT 
GTGGCGCCCT 
TAGGTCGGCG 
GAAGTCATCC 
GGCCATAACG 
ACGCGAGTAA 
TCCCACCAAA 
TCCGGGGGCG 
GGTGATGCCG 
GCGCAGCGGC 
GTTGACGCTC 
TGGATAAATT 
TCCGCCGTGA 
ACGGGGGAGT 
GGCCACTGGC 
GCTCCCTGTA 
CTCGGACCGG 



TGGAGATCTC 
ATGAGCTGCG 
CCTTCGGCAT 
AAGACGGCGT 
GCCACGAAGA 
TCAAGGCGCT 
GCCGACACGG 
TCGCGCTCAA 
TCCCCTTCTT 
ACCGGGAGGC 
GTGACGGCGC 
TTATGGGTTG 
AATTGTTGTG 
GAAAACCTCT 
GCGGGCGGCA 
TTAAAGTAGG 
GCCTGCTGAA 
TCTTTGTAGT 
CCTGCATCTC 
CTTCCTCCCA 
ACAACGCGCT 
ATGTCCACAA 
GACCAGTTAA 
GCCCTCGAGT 
AAGTGCGGCG 
AGATCTTCCA 
GCGGCGGTGG 
AAAAAGTGCT 
TAGACCGTGC 
CGCAAGGGTA 
TCCATGCGGT 
GCTCCTTTTG 
CGCGCGCAGC 
GCCGGAGGGT 
CCGGACTGCG 



CGCGTCCGGC 
AGAAGGCGTT 
CGCGGGCGCG 
AGTTTCGCAG 
AGTACATAAC 
CCATGGCCTC 
TTAACTCCTC 
AGGCTACAGG 
CTTCTTCTGG 
GGTCGACAAA 
GGCCGTTCTC 
GCGGGGGGCT 
TAGGTACTCC 
CGAGAAAGGC 
GCGGGCGGCG 
CGGTCTTGAG 
TGCGCAGGCG 
AGTCTTGCAT 
TTGCATCTAT 
TGCGTGTGAC 
CGGCTAATAT 
AGCGGTGGTA 
CGGTCTGGTG 
CAAATACGTA 
GCGGCTGGCG 
ACATAAGGCG 
TGGAGGCGCG 
CCATGGTCGG 
AAAAGGAGAG 
TCATGGCGGA 
TACCGCCCGC 
GCTTCCTTCC 
GTAAGCGGTT 
TATTTTCCAA 
GCGAACGGGG 
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TCGCTCCACG 


8940 


GAGGCCTCCC 


9000 


CATGACCACC 


9060 


GCGCTGAAAG 


9120 


CCAGCGTCGC 


9180 


GTAGAAGTCC 


9240 


CTCCAGAAGA 


9300 


GGCCTCTTCT 


9360 


CGGCGGTGGG 


9420 


GCGCTCGATC 


9480 


GCGGGGGCGC 


9540 


GCCATGCGGC 


9600 


GCCGCCGAGG 


9660 


GTCTAACCAG 


9720 


GTCGGGGTTG 


9780 


ACGGCGGATG 


9840 


GTCGGCCATG 


9900 


GAGCCTTTCT 


9960 


CGCTGCGGCG 


10020 


CCCGAAGCCC 


10080 


GGCCTGCTGC 


10140 


TGCGCCCGTG 


10200 


ACCCGGCTGC 


10260 


GTCGTTGC7VA 


10320 


GTAGAGGGGC 


10380 


ATGATATCCG 


10440 


CGGAAAGTCG 


10500 


GACGCTCTGG 


10560 


CCTGTAAGCG 


10620 


CGACCGGGGT 


10680 


GTGTCGAACC 


10740 


AGGCGCGGCG 


10800 


AGGCTGGAAA 


10860 


GGGTTGAGTC 


10920 


GTTTGCCTCC 


10980 
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11040 
11100 
lllSO 
11220 
11280 
11340 
11400 
11460 



CCGTCATGCA AGACC^^T TGCAAATTCC TCCGGAAACA GGGACGAGCC CCTTTTTTGC 
TTTTCCCAGA TGCATCCGGT GCTGCGGCAG ATGCGCCCCC CTCCTCAGCA GCGGCAAGAG 
CAAGAGCAGC GGCAGACATG CAGGGCACCC TCCCCTCCTC CTACCGCGTC AGGAGGGGCG 
ACATCCGCGG TTGACGCGGC AGCAGATGGT GATTACGAAC CCCCGCGGCG CCGGGCCCGG 
CACTACCTGG ACTTGGAGGA GGGCGAGGGC CTGGCGCGGC TAGGAGCGCC CTCTCCTGAG 
CGGTACCCAA GGGTGCAGCT GAAGCGTGAT ACGCGTGAGG CGTACGTGCC GCGGCAGAAC 
CTGTTTCGCG ACCGCGAGGG AGAGGAGCCC GAGGAGATGC GGGATCGAAA GTTCCACGCA 
GGGCGCGAGC TGCGGCATGG CCTGAATCGC GAGCGGTTGC TGCGCGAGGA GGACTTTGAG 

CCCGACGCGC GAACCGGGAT TAGTCCCGCG CGCGCACACG TGGCGGCCGC CGACCTGGTA 11520 

ACCGCATACG AGCAGACGGT GAACCAGGAG ATTAACTTTC AAAAAAGCTT TAACAACCAC 11580 

GTGCGTACGC TTGTGGCGCG CGAGGAGGTG GCTATAGGAC TGATGCATCT GTGGGACTTT 11640 

GTAAGCGCGC TGGAGCAAAA CCCAAATAGC AAGCCGCTCA TGGCGCAGCT GTTCCTTATA 11700 

GTGCAGCACA GCAGGGACAA CGAGGCATTC AGGGATGCGC TGCTAAACAT AGTAGAGCCC 11760 

GAGGGCCGCT GGCTGCTCGA TTTGATAAAC ATCCTGCAGA GCATAGTGGT GCAGGAGCGC 11820 

AGCTTGAGCC TGGCTGACAA GGTGGCCGCC ATCAACTATT CCATGCTTAG CCTGGGCAAG 11880 

TTTTACGCCC GCAAGATATA CCATACCCCT TACGTTCCCA TAGACAAGGA GGTAAAGATC 11940 

GAGGGGTTCT ACATGCGCAT GGCGCTGAAG GTGCTTACCT TGAGCGACGA CCTGGGCGTT 12000 

TATCGC.:^CG AGCGCATCCA CAAGGCCGTG AGCGTGAGCC GGCGGCGCGA GCTCAGCGAC 12060 

CGCGAGCTGA TGCACAGCCT GCAAAGGGCC CTGGCTGGCA CGGGCAGCGG CGATAGAGAG 12120 

GCCGAGTCCT ACTTTGACGC GGGCGCTGAC CTGCGCTGGG CCCCAAGCCG ACGCGCCCTG 12180 

GAGGCAGCTG GGGCCGGACC TGGGCTGGCG GTGGCACCCG CGCGCGCTGG CAACGTCGGC 12240 

GGCGTGGAGG AATATGACGA GGACGATGAG TACGAGCCAG AGGACGGCGA GTACTAAGCG 12300 

GTGATGTTTC TGATCAGATG ATGCAAGACG CAACGGACCC GGCGGTGCGG GCGGCGCTGC 12360 

AGAGCCAGCC GTCCGGCCTT AACTCCACGG ACGACTGGCG CCAGGTCATG GACCGCATCA 12420 

TGTCGCTGAC TGCGCGCAAT CCTGACGCGT TCCGGCAGCA GCCGCAGGCC AACCGGCTCT 12480 

CCGCAATTCT GGAAGCGGTG GTCCCGGCGC GCGCAAACCC CACGCACGAG AAGGTGCTGG 12540 

CGATCGTAAA CGCGCTGGCC GAAAACAGGG CCATCCGGCC CGACGAGGCC GGCCTGGTCT 12600 

ACGACGCGCT GCTTCAGCGC GTGGCTCGTT ACAACAGCGG CAACGTGCAG ACCAACCTGG 12660 

ACCGGCTGGT GGGGGATGTG CGCGAGGCCG TGGCGCAGCG TGAGCGCGCG CAGCAGCAGG 12720 

GCAACCTGGG CTCCATGGTT GCACTAAACG CCTTCCTGAG TACACAGCCC GCCAACGTGC 12780 

CGCGGGGACA GGAGGACTAC ACCAACTTTG TGAGCGCACT GCGGCTAATG GTGACTGAGA 12 84 0 

CACCGCAAAG TGAGGTGTAC CAGTCTGGGC CAGACTATTT TTTCCAGACC AGTAGACAAG 12 900 

GCCTGCAGAC CGTAAACCTG AGCCAGGCTT TCAAAAACTT GCAGGGGCTG TGGGGGGTGC 12960 

GGGCTCCCAC AGGCGACCGC GCGACCGTGT CTAGCTTGCT GACGCCCAAC TCGCGCCTGT 13 02 0 

TGCTGCTGCT AATAGCGCCC TTCACGGACA GTGGCAGCGT GTCCCGGGAC ACATACCTAG 13080 
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13140 



TCCAGGAGAT TACAAGTGTC AGCCGCGCGC TGGGGCAGGA GGACACGGGC AGCCTGGAGG 13200 

CAACCCTAAA CTACCTGCTG ACCAACCGGC GGCAGAAGAT CCCCTCGTTG CACAGTTTAA 13 26 0 

ACAGCGAGGA GGAGCGCATT TTGCGCTACG TGCAGCAGAG CGTGAGCCTT AACCTGATGC 13320 

GCGACGGGGT AACGCCCAGC GTGGCGCTGG ACATGACCGC GCGCAACATG GAACCGGGCA 13 3 80 

TGTATGCCTC AAACCGGCCG TTTATCAACC GCCTAATGGA CTACTTGCAT CGCGCGGCCG 134 4 0 

CCGTGAACCC CGAGTATTTC ACCAATGCCA TCTTGAACCC GCACTGGCTA CCGCCCCCTG 13 500 

GTTTCTACAC CGGGGGATTC GAGGTGCCCG AGGGTAACGA TGGATTCCTC TGGGACGACA 13 560 

TAGACGACAG CGTGTTTTCC CCGCAACCGC AGACCCTGCT AGAGTTGCAA CAGCGCGAGC 13 620 

AGGCAGAGGC GGCGCTGCGA AAGGAAAGCT TCCGCAGGCC AAGCAGCTTG TCCGATCTAG 136 8 0 

GCGCTGCGGC CCCGCGGTCA GATGCTAGTA GCCCATTTCC AAGCTTGATA GGGTCTCTTA 13 74 0 

CCAGCACTCG CACCACCCGC CCGCGCCTGC TGGGCGAGGA GGAGTACCTA AACAACTCGC 13 80 0 

TGCTGCAGCC GCAGCGCGAA AAAAACCTGC CTCCGGCATT TCCCAACAAC GGGATAGAGA 13 860 

GCCTAGTGGA CAAGATGAGT AGATGGAAGA CGTACGCGCA GGAGCACAGG GACGTGCCAG 13 92 0 

GCCCGCGCCC GCCCACCCGT CGTCAAAGGC ACGACCGTCA GCGGGGTCTG GTGTGGGAGG 13 980 

ACGATGACTC GGCAGACGAC AGCAGCGTCC TGGATTTGGG AGGGAGTGGC AACCCGTTTG 14 040 

CGCACCTTCG CCCCAGGCTG GGGAGAATGT TTTAAAAAAA AAAAAGCATG ATGCAAAATA 1410 0 

AAAAACTCAC CAAGGCCATG GCACCGAGCG TTGGTTTTCT TGTATTCCCC TTAGTATGCG 1416 0 

GCGCGCGGCG ATGTATGAGG AAGGTCCTCC TCCCTCCTAC GAGAGTGTGG TGAGCGCGGC 14 22 0 

GCCAGTGGCG GCGGCGCTGG GTTCTCCCTT CGATGCTCCC CTGGACCCGC CGTTTGTGCC 1428 0 

TCCGCGGTAC CTGCGGCCTA CCGGGGGGAG AAACAGCATC CGTTACTCTG AGTTGGCACC 14 34 0 

CCTATTCGAC ACCACCCGTG TGTACCTGGT GGACAACAAG TCAACGGATG TGGCATCCCT 14400 

GAACTACCAG AACGACCACA GCAACTTTCT GACCACGGTC ATTCAAAACA ATGACTACAG 144 6 0 

CCGGGGGGAG GCAAGCACAC AGACCATCAA TCTTGACGAC CGGTCGCACT GGGGCGGCGA 14 52 0 

CCTGAAAACC ATCCTGCATA CCAACATGCC AAATGTGAAC GAGTTCATGT TTACCAATAA 14 58 0 

GTTTAAGGCG CGGGTGATGG TGTCGCGCTT GCCTACTAAG GACAATCAGG TGGAGCTGAA 14 64 0 

ATACGAGTGG GTGGAGTTCA CGCTGCCCGA GGGCAACTAC TCCGAGACCA TGACCATAGA 14 70 0 

CCTTATGAAC AACGCGATCG TGGAGCACTA CTTGAAAGTG GGCAGACAGA ACGGGGTTCT 14 76 0 

GGAAAGCGAC ATCGGGGTAA AGTTTGACAC CCGCAACTTC AGACTGGGGT TTGACCCCGT 14 820 

CACTGGTCTT GTCATGCCTG GGGTATATAC AAACGAAGCC TTCCATCCAG ACATCATTTT 14 88 0 

GCTGCCAGGA TGCGGGGTGG ACTTCACCCA CAGCCGCCTG AGCAACTTGT TGGGCATCCG 14 94 0 

CAAGCGGCAA CCCTTCCAGG AGGGCTTTAG GATCACCTAC GATGATCTGG AGGGTGGTAA 1500 0 

CATTCCCGCA CTGTTGGATG TGGACGCCTA CCAGGCGAGC TTGAAAGATG ACACCGAACA 1506 0 

GGGCGGGGGT GGCGCAGGCG GCAGCAACAG CAGTGGCAGC GGCGCGGAAG AGAACTCCAA 1512 0 

CGCGGCAGCC GCGGCAATGC AGCCGGTGGA GGACATGAAC GATCATGCCA TTCGCGGCGA 15180 
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C>.CC..TG^C kCACGG^R AGGAGAAGCG CGCTGAGGCC GAAGCAG^^ CCGAAGCTGC 1524 0 

CGCCCCCOCr GCGCAACCCG AGGTCGAGAA GCCTCAGAAG AAACCGGTGA TCAAACCCCT 15300 

GACAGAGGAC AGCAAGAAAC GCAGTTACAA CCTAATAAGC AATGACAGCA CCTTCACCCA 15360 

GTACCGCAGC TGGTACCTTG CATACAACTA CGGCGACCCT CAGACCGGAA TCCGCTCATG 15420 

GACCCTGCTT TGCACTCCTG ACGTAACCTG CGGCTCGGAG CAGGTCTACT GGTCGTTGCC 15480 

AGACATGATG CAAGACCCCG TGACCTTCCG CTCCACGCGC CAGATCAGCA ACTTTCCGGT 15 540 

GGTGGGCGCC GAGCTGTTGC CCGTGCACTC CAAGAGCTTC TACAACGACC AGGCCGTCTA 15600 

CTCCCAACTC ATCCGCCAGT TTACCTCTCT GACCCACGTG TTCAATCGCT TTCCCGAGAA 15660 

CCAGATTTTG GCGCGCCCGC CAGCCCCCAC CATCACCACC GTCAGTGAAA ACGTTCCTGC 15720 

TCTCACAGAT CACGGGACGC TACCGCTGCG CAACAGCATC GGAGGAGTCC AGCGAGTGAC 15780 

CATTACTGAC GCCAGACGCC GCACCTGCCC CTACGTTTAC AAGGCCCTGG GCATAGTCTC 15 840 

GCCGCGCGTC CTATCGAGCC GCACTTTTTG AGCAAGCATG TCCATCCTTA TATCGCCCAG 15900 

CAATAACACA GGCTGGGGCC TGCGCTTCCC AAGCAAGATG TTTGGCGGGG CCAAGAAGCG 15 960 

CTCCGACC.^ CACCCAGTGC GCGTGCGCGG GCACTACCGC GCGCCCTGGG GCGCGCACAA 1602 0 

ACGCGGCCGC ACTGGGCGCA CCACCGTCGA TGACGCCATC GACGCGGTGG TGGAGGAGGC 16080 

GCGCAACTAC ACGCCCACGC CGCCACCAGT GTCCACAGTG GACGCGGCCA TTCAGACCGT 1614 0 

GGTGCGCGGA GCCCGGCGCT ATGCTAAAAT GAAGAGACGG CGGAGGCGCG TAGCACGTCG 16200 

CCACCGCCGC CGACCCGGCA CTGCCGCCCA ACGCGCGGCG GCGGCCCTGC TTAACCGCGC 16260 

ACGTCGCACC GGCCGACGGG CGGCCATGCG GGCCGCTCGA AGGCTGGCCG CGGGTATTGT 16 32 0 

CACTGTGCCC CCCAGGTCCA GGCGACGAGC GGCCGCCGCA GCAGCCGCGG CCATTAGTGC 16380 

TATGACTCAG GGTCGCAGGG GCAACGTGTA TTGGGTGCGC GACTCGGTTA GCGGCCTGCG 16 44 0 

CGTGCCCGTG CGCACCCGCC CCCCGCGCAA CTAGATTGCA AGAAAAAACT ACTTAGACTC 16 500 

GTACTGTTGT ATGTATCCAG CGGCGGCGGC GCGCAACGAA GCTATGTCCA AGCGCAAAAT 16 56 0 

CAAAGAAGAG ATGCTCCAGG TCATCGCGCC GGAGATCTAT GGCCCCCCGA AGAAGGAAGA 16620 

GCAGGATTAC AAGCCCCGAA AGCTAAAGCG GGTCAAAAAG AAAAAGAAAG ATGATGATGA 16680 

TGAACTTGAC GACGAGGTGG AACTGCTGCA CGCTACCGCG CCCAGGCGAC GGGTACAGTG 16740 

GAAAGGTCGA CGCGTAAAAC GTGTTTTGCG ACCCGGCACC ACCGTAGTCT TTACGCCCGG 16800 

TGAGCGCTCC ACCCGCACCT ACAAGCGCGT GTATGATGAG GTGTACGGCG ACGAGGACCT 16860 

GCTTGAGCAG GCCAACGAGC GCCTCGGGGA GTTTGCCTAC GGAAAGCGGC ATAAGGACAT 16920 

GCTGGCGTTG CCGCTGGACG AGGGCAACCC AACACCTAGC CTAAAGCCCG TAACACTGCA 16980 

GCAGGTGCTG CCCGCGCTTG CACCGTCCGA AGAAAAGCGC GGCCTAAAGC GCGAGTCTGG 17040 

TGACTTGGCA CCCACCGTGC AGCTGATGGT ACCCAAGCGC CAGCGACTGG AAGATGTCTT 17100 

GGAAAAAATG ACCGTGGAAC CTGGGCTGGA GCCCGAGGTC CGCGTGCGGC CAATCAAGCA 17160 

GGTGGCGCCG GGACTGGGCG TGCAGACCGT GGACGTTCAG ATACCCACTA CCAGTAGCAC 17220 

CAGTATTGCC ACCGCCACAG AGGGCATGGA GACACAAACG TCCCCGGTTG CCTCAGCGGT 172 80 
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GGCGGATCCC GCGGTgWK CGGTCGCTGC GGCCGCGTCC AAGACC^Wf CGGAGGTGCA 1734 0 

AACGGACCCG TGGATGTTTC GCGTTTCAGC CCCCCGGCGC CCGCGCGGTT CGAGGAAGTA 17400 

CGGCGCCGCC AGCGCGCTAC TGCCCGAATA TGCCCTACAT CCTTCCATTG CGCCTACCCC 1746 0 

CGGCTATCGT GGCTACACCT ACCGCCCCAG AAGACGAGCA ACTACCCGAC GCCGAACCAC 17 520 

CACTGGAACC CGGCGCCGCC GTCGCCGTCG CCAGCCCGTG CTGGCCCCGA TTTCCGTGCG 17 580 

CAGGGTGGCT CGCGAAGGAG GCAGGACCCT GGTGCTGCCA ACAGCGCGCT ACCACCCCAG 1764 0 

CATCGTTTAA AAGCCGGTCT TTGTGGTTCT TGCAGATATG GCCCTCACCT GCCGCCTCCG 1770 0 

TTTCCCGGTG CCGGGATTCC GAGGAAGAAT GCACCGTAGG AGGGGCATGG CCGGCCACGG 17 760 

CCTGACGGGC GGCATGCGTC GTGCGCACCA CCGGCGGCGG CGCGCGTCGC ACCGTCGCAT 17 820 

GCGCGGCGGT ATCCTGCCCC TCCTTATTCC ACTGATCGCC GCGGCGATTG GCGCCGTGCC 1788 0 

CGGAATTGCA TCCGTGGCCT TGCAGGCGCA GAGACACTGA TTAAAAACAA GTTGCATGTG 17 940 

GAAAAATCAA AATAAAAAGT CTGGACTCTC ACGCTCGCTT GGTCCTGTAA CTATTTTGTA 18 000 

GAATGGAAGA CATCAACTTT GCGTCTCTGG CCCCGCGACA CGGCTCGCGC CCGTTCATGG 18 060 

GAAACTGGCA AGATATCGGC ACCAGCAATA TGAGCGGTGG CGCCTTCAGC TGGGGCTCGC 1812 0 

TGTGGAGCGG CATTAAAAAT TTCGGTTCCA CCGTTAAGAA CTATGGCAGC AAGGCCTGGA 1818 0 

ACAGCAGCAC AGGCCAGATG CTGAGGGATA AGTTGAAAGA GCAAAATTTC CAACAAAAGG 1824 0 

TGGTAGATGG CCTGGCCTCT GGCATTAGCG GGGTGGTGGA CCTGGCCAAC CAGGCAGTGC 183 00 

AAAATAAGAT TAACAGTAAG CTTGATCCCC GCCCTCCCGT AGAGGAGCCT CCACCGGCCG 18 360 

TGGAGACAGT GTCTCCAGAG GGGCGTGGCG AAAAGCGTCC GCGCCCCGAC AGGGAAGAAA 18420 

CTCTGGTGAC GCAAATAGAC GAGCCTCCCT CGTACGAGGA GGCACTAAAG CAAGGCCTGC 18480 

CCACCACCCG TCCCATCGCG CCCATGGCTA CCGGAGTGCT GGGCCAGCAC ACACCCGTAA 18 540 

CGCTGGACCT GCCTCCCCCC GCCGACACCC AGCAGAAACC TGTGCTGCCA GGCCCGACCG 18600 

CCGTTGTTGT AACCCGTCCT AGCCGCGCGT CCCTGCGCCG CGCCGCCAGC GGTCCGCGAT 1866 0 

CGTTGCGGCC CGTAGCCAGT GGCAACTGGC AAAGCACACT GAACAGCATC GTGGGTCTGG 1872 0 

GGGTGCAATC CCTGAAGCGC CGACGATGCT TCTGAATAGC TAACGTGTCG TATGTGTGTC 1878 0 

ATGTATGCGT CCATGTCGCC GCCAGAGGAG CTGCTGAGCC GCCGCGCGCC CGCTTTCCAA 18 840 

GATGGCTACC CCTTCGATGA TGCCGCAGTG GTCTTACATG CACATCTCGG GCCAGGACGC 18 900 

CTCGGAGTAC CTGAGCCCCG GGCTGGTGCA GTTTGCCCGC GCCACCGAGA CGTACTTCAG 18 960 

CCTGAATAAC AAGTTTAGAA ACCCCACGGT GGCGCCTACG CACGACGTGA CCACAGACCG 1902 0 

GTCCCAGCGT TTGACGCTGC GGTTCATCCC TGTGGACCGT GAGGATACTG CGTACTCGTA 1908 0 

CAAGGCGCGG TTCACCCTAG CTGTGGGTGA TAACCGTGTG CTGGACATGG CTTCCACGTA 1914 0 

CTTTGACATC CGCGGCGTGC TGGACAGGGG CCCTACTTTT AAGCCCTACT CTGGCACTGC 19200 

CTACAACGCC CTGGCTCCCA AGGGTGCCCC AAATCCTTGC GAATGGGATG AAGCTGCTAC 19260 

TGCTCTTGAA ATAAACCTAG AAGAAGAGGA CGATGACAAC GAAGACGAAG TAGACGAGCA 19320 

AGCTGAGCAG CAAAAAACTC ACGTATTTGG GCAGGCGCCT TATTCTGGTA TAAATATTAC 19380 
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AAAGGAGGGT 


attca^Wkg 

*^ X X Vk* VJ 


' O i 1 CCAAVjtj 


I TCAAACACCT 


' AAATATGCCG ATAAAACATT 


1944 0 


TCAAC^TGAA 


pr"rp A i a T a. n 


(jACjAATCTCA 


GTGGTACGAA 


. ACTGAAATTA ATCATGCAGC 


19500 


TGGGAGAGTP 




ClACCCCAAT 


GAAACCATGT 


' TACGGTTCAT 


' ATGCAAAACC 


19560 


CACAAATGAA 




aaggcattct 


TGTAAAGCAA 


CAAAATGGAA 


. AGCTAGAAAG 


19620 


TCAAGTGGAA 


A TCI r* 21 a XT'TT' 


i c rCAACTAC 


TGAGGCGACC 


GCAGGCAATG 


GTGATAACTT 


19680 


GACTCCTAAA 


X i 1 i <>j i 


alagtgaaga 


TGTAGATATA 


GAAACCCCAG 


ACACTCATAT 


19740 






aggaaggtaa 


CTCACGAGAA 


CTAATGGGCC 


AACAATCTAT 


19800 


GCCCAACAGG 


CPT A A TT a a 


i ILjCTTTTAG 


GGACAATTTT 


ATTGGTCTAA 


TGTATTACAA 


19860 


CAGCACGGGT 


AAT A TCiCZrzTr' 


i i C i GGCGGG 


CCAAGCATCG 


CAGTTGAATG 


CTGTTGTAGA 


19920 


TTTGCAAGAC 


AGAAAPapnr: 


Auc I r rcATA 


CCAGCTTTTG 


CTTGATTCCA 


TTGGTGATAG 


19980 


AACCAGGTAC 


X X i XV„ii-ixOi 


UUAATCAGGC 


TGTTGACAGC 


TATGATCCAG 


ATGTTAGAAT 


20040 


TATTGAAAAT 




AAGATGAACT 


TCCAAATTAC 


TGCTTTCCAC 


TGGGAGGTGT 


20100 


GATTAATACA 


A^J 2ir'T(^TTn 


/^<^A A /~< TV 7N Is 

c C AAGGT AAA 


ACCTAAAACA 


GGTCAGGAAA 


ATGGATGGGA 


20160 


AAAAGATGPT 


rtV„>\VjM>\l i i i 


CAGATAAAAA 


TGAAATAAGA 


GTTGGAAATA 


ATTTTGCCAT 


20220 


GGAAATCAAT 




ACCTGTGGAG 


AAATTTCCTG 


TACTCCAACA 


TAGCGCTGTA 


20280 


TTTGCCCGAC 


A An r^T a a a r—T 


ACAGTCCTTC 


CAACGTAAAA 


ATTTCTGATA 


ACCCAAACAC 


20340 


CTACGACTAC 




GAGTGGTGGC 


TCCCGGGTTA 


GTGGACTGCT 


ACATTAACCT 


20400 


TGGAGCACGC 


T* n TT^ T* 


ACTATATGGA 


CAACGTCAAC 


CCATTTAACC 


ACCACCGCAA 


20460 


TGCTGGCPTr; 




CAATGTTGCT 


GGGCAATGGT 


CGCTATGTGC 


CCTTCCACAT 


20520 


CCAGGTGCCT 




TTGCCATTAA 


AAACCTCCTT 


CTCCTGCCGG 


GCTCATACAC 


20580 


CTACGAGTGG 


A a r^T"T^CTs.r* a 


AGGATGTTAA 


CATGGTTCTG 


CAGAGCTCCC 


TAGGAAATGA 


20640 


CCTAAGGGTT 


n a r^rr' n/^cT^'^ 

*j>\Vw OLj AVj 1_ A 


GCATTAAGTT 


TGATAGCATT 


TGCCTTTACG 


CCACCTTCTT 


20700 


CCCCATGGCC 




CCTCCACGCT 


TGAGGCCATG 


CTTAGAAACG 


ACACCAACGA 


20760 


CCAGTCCTTT 

X ^ Vim X X X 


AML_ AC i A i t_ 


TCTCCGCCGC 


CAACATGCTC 


TACCCTATAC 


CCGCCAACGC 


20820 


TACCAAffiTn 


C U UM i A i C 


TCCCCTCCCG 


CAACTGGGCG 


GCTTTCCGCG 


GCTGGGCCTT 


20880 


CACGCGCCTT 


AAVj AV_ 1 AAVjCj 


AAACCCCATC 


ACTGGGCTCG 


GGCTACGACC 


CTTATTACAC 


20940 


CTACTCTGG C 


T* T a T* J\ /^T 
i C i A i ACCC i 


ACCTAGATGG 


AACCTTTTAC 


CTCAACCACA 


CCTTTAAGAA 


21000 


GGTGGCCATT 


AV- L. 1 i 1 At_ 1 


CTTCTGTCAG 


CTGGCCTGGC 


AATGACCGCC 


TGCTTACCCC 


21060 


CAACGAflTTT 


LjAAAI lAAGC 


GCTCAGTTGA 


CGGGGAGGGT 


TACAACGTTG 


CCCAGTGTAA 


21120 


CATGACCAAA 


^ AC i V^iij i i C C 


TGGTACAAAT 


GCTAGCTAAC 


TACAACATTG 


GCTACCAGGG 


21180 


CTTCTATATr 


C C A(j AVjAL» C T 


ACAAGGACCG 


CATGTACTCC 


TTCTTTAGAA 


ACTTCCAGCC 


21240 


CATGAGCCGT 


C AoVj i Cilj i OVj 


ATGATACTAA 


ATACAAGGAC 


TACCAACAGG 


TGGGCATCCT 


21300 


ACACCAACAC 


AAC AAC 1 C 1 G 


GATTTGTTGG 


CTACCTTGCC 


CCCACCATGC 


GCGAAGGACA 


21360 


GGCCTACCCT 


r^r*"Fa a (^T'Tr^r^ 

'JClAACi iCC 


CCTATCCGCT 


TATAGGCAAG 


ACCGCAGTTG 


ACAGCATTAC 


21420 




TTTCTTTGCG 


ATCGCACCCT 


TTGGCGCATC 


CCATTCTCCA 


GTAACTTTAT 


21480 
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GTCCATGGGC 


GCACTCi?W?G 


ACCTGGGCCA 


AAACCTTCTC 


tacgcca!^' 


CCGCCCACGC 


21540 


GCTAGACATG 


ACTTTTGAGG 


TGGATCCCAT 


GGACGAGCCC 


ACCCTTCTTT 


ATGTTTTGTT 


21600 


TGAAGTCTTT 


GACGTGGTCC 


GTGTGCACCG 


GCCGCACCGC 


ggcgtcatcg 


AAACCGTGTA 


21660 


CCTGCGCACG 


CCCTTCTCGG 


CCGGCAACGC 


CACAACATAA 


agaagcaagc 


.AACATCAACA 


21720 


ACAGCTGCCG 


CCATGGGCTC 


CAGTGAGCAG 


GAACTGAAAG 


ccattgtcaa 


AGATCTTGGT 


21780 


TGTGGGCCAT 


ATTTTTTGGG 


CACCTATGAC 


AAGCGCTTTC 


caggctttgt 


TTCTCCACAC 


21840 


AAGCTCGCCT 


GCGCCATAGT 


CAATACGGCC 


GGTCGCGAGA 


ctgggggcgt 


ACACTGGATG 


21900 


GCCTTTGCCT 


GGAACCCGCA 


CTCAAAAACA 


TGCTACCTCT 


TTGAGCCCTT 


TGGCTTTTCT 


21960 


GACCAGCGAC 


TCAAGCAGGT 


TTACCAGTTT 


GAGTACGAGT 


CACTCCTGCG 


CCGTAGCGCC 


22020 


ATTGCTTCTT 


CCCCCGACCG 


CTGTATAACG 


CTGGAAAAGT 


CCACCCAAAG 


CGTACAGGGG 


22080 


CCCAACTCGG 


CCGCCTGTGG 


ACTATTCTGC 


TGCATGTTTC 


TCCACGCCTT 


TGCCAACTGG 


22140 


CCCCAAACTC 


CCATGGATCA 


CAACCCCACC 


ATGAACCTTA 


TTACCGGGGT 


ACCCAACTCC 


22200 


ATGCTCAACA 


GTCCCCAGGT 


ACAGCCCACC 


CTGCGTCGCA 


ACCAGGAACA 


GCTCTACAGC 


22260 


TTCCTGGAGC 


GCCACTCGCC 


CTACTTCCGC 


AGCCACAGTG 


CGCAGATTAG 


GAGCGCCACT 


22320 


TCTTTTTGTC 


ACTTGAAAAA 


CATGTAAAAA 


TAATGTACTA 


GAGACACTTT 


CAATAAAGGC 


22380 


AAATGCTTTT 


ATTTGTACAC 


TCTCGGGTGA 


TTATTTACCC 


CCACCCTTGC 


CGTCTGCGCC 


22440 


GTTTAAAAAT 


CAAAGGGGTT 


CTGCCGCGCA 


TCGCTATGCG 


CCACTGGCAG 


GGACACGTTG 


22500 


CGATACTGGT 


GTTTAGTGCT 


CCACTTAAAC 


TCAGGCACAA 


CCATCCGCGG 


CAGCTCGGTG 


22560 


AAGTTTTCAC 


TCCACAGGCT 


GCGCACCATC 


ACCAACGCGT 


TTAGCAGGTC 


GGGCGCCGAT 


22620 


ATCTTGAAGT 


CGCAGTTGGG 


GCCTCCGCCC 


TGCGCGCGCG 


AGTTGCGATA 


CACAGGGTTG 


22680 


CAGCACTGGA 


ACACTATCAG 


CGCCGGGTGG 


TGCACGCTGG 


CCAGCACGCT 


CTTGTCGGAG 


22740 


ATCAGATCCG 


CGTCCAGGTC 


CTCCGCGTTG 


CTCAGGGCGA 


ACGGAGTCAA 


CTTTGGTAGC 


22800 


TGCCTTCCCA 


AAAAGGGCGC 


GTGCCCAGGC 


TTTGAGTTGC 


ACTCGCACCG 


TAGTGGCATC 


22860 


AAAAGGTGAC 


CGTGCCCGGT 


CTGGGCGTTA 


GGATACAGCG 


CCTGCATAAA 


AGCCTTGATC 


22920 


TGCTTAAAAG 


CCACCTGAGC 


CTTTGCGCCT 


TCAGAGAAGA 


ACATGCCGCA 


AGACTTGCCG 


22980 


GAAAACTGAT 


TGGCCGGACA 


GGCCGCGTCG 


TGCACGCAGC 


ACCTTGCGTC 


GGTGTTGGAG 


23040 


ATCTGCACCA 


CATTTCGGCC 


CCACCGGTTC 


TTCACGATCT 


TGGCCTTGCT 


AGACTGCTCC 


23100 


TTCAGCGCGC 


GCTGCCCGTT 


TTCGCTCGTC 


ACATCCATTT 


CAATCACGTG 


CTCCTTATTT 


23160 


ATCATAATGC 


TTCCGTGTAG 


ACACTTAAGC 


TCGCCTTCGA 


TCTCAGCGCA 


GCGGTGCAGC 


23220 


CACAACGCGC 


AGCCCGTGGG 


CTCGTGATGC 


TTGTAGGTCA 


CCTCTGCAAA 


CGACTGCAGG 


23280 


TACGCCTGCA 


GGAATCGCCC 


CATCATCGTC 


ACAAAGGTCT 


TGTTGCTGGT 


GAAGGTCAGC 


23340 


TGCAACCCGC 


GGTGCTCCTC 


GTTCAGCCAG 


GTCTTGCATA 


CGGCCGCCAG 


AGCTTCCACT 


23400 


TGGTCAGGCA 


GTAGTTTGAA 


GTTCGCCTTT 


AGATCGTTAT 


CCACGTGGTA 


CTTGTCCATC 


23460 


AGCGCGCGCG 


CAGCCTCCAT 


GCCCTTCTCC 


CACGCAGACA 


CGATCGGCAC 


ACTCAGCGGG 


23520 


TTCATCACCG 


TAATTTCACT 


TTCCGCTTCG 


CTGGGCTCTT 


CCTCTTCCTC 


TTGCGTCCGC 


23580 
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•ggWc 



23880 
23940 
24 000 
24060 
24120 



WO 98/17783 

ATACCACGCG CCACTgWc GTCTTCATTC AGCCGCCGCA CTGTGCGCTT ACCTCCTTTG 2 364 0 
CCATGCTTGA TTAGCACCGG TGGGTTGCTG AAACCCACCA TTTGTAGCGC CACATCTTC^ 23700 
CTTTCTTCCT CGCTGTCCAC GATTACCTCT GGTGATGGCG GGCGCTCGGG CTTGGGAGAA 23760 
GGGCGCTTCT TTTTCTTCTT GGGCGCAATG GCCAAATCCG CCGCCGAGGT CGATGGCCGC 23820 
GGGCTGGGTG TGCGCGGCAC CAGCGCGTCT TGTGATGAGT CTTCCTCGTC CTCGGACTCG 
ATACGCCGCC TCATCCGCTT TTTTGGGGGC GCCCGGGGAG GCGGCGGCGA CGGGGACGGG 
GACGACACGT CCTCCATGGT TGGGGGACGT CGCGCCGCAC CGCGTCCGCG CTCGGGGGTG 
GTTTCGCGCT GCTCCTCTTC CCGACTGGCC ATTTCCTTCT CCTATAGGCA GAAAAAGATC 
ATGGAGTCAG TCGAGAAGAA GGACAGCCTA ACCGCCCCCT CTGAGTTCGC CACCACCGCC 

TCCACCGATG CCGCCAACGC GCCTACCACC TTCCCCGTCG AGGCACCCCC GCTTGAGGaI 24180 

,GAGGAAGTGA TTATCGAGGA GGACCCAGGT TTTGTAAGCG AAGACGACGA GGACCGCTCA 24240 

GTACCAACAG AGGATAAAAA GCAAGACCAG GACAACGCAG AGGCAAACGA GGAACAAGTC 24300 

GGGCGGGGGG ACGAAAGGCA TGGCGACTAC CTAGATGTGG GAGACGACGT GCTGTTGAAG 24360 

CATCTGCAGC GCCAGTGCGC CATTATCTGC GACGCGTTGC AAGAGCGCAG CGATGTGCCC 24420 

CTCGCCATAG CGGATGTCAG CCTTGCCTAC GAACGCCACC TATTCTCACC GCGCGTACCC 24480 

CCCAAACGCC AAGAAAACGG CACATGCGAG CCCAACCCGC GCCTCAACTT CTACCCCGTA 24540 

TTTGCCGTGC CAGAGGTGCT TGCCACCTAT CACATCTTTT TCCAAAACTG CAAGATACCC 24600 

CTATCCTGCC GTGCCAACCG CAGCCGAGCG GACAAGCAGC TGGCCTTGCG GCAGGGCGCT 24660 

GTCATACCTG ATATCGCCTC GCTCAACGAA GTGCCAAAAA TCTTTGAGGG TCTTGGACGC 24720 

GACGAGAAGC GCGCGGCAAA CGCTCTGCAA CAGGAAAACA GCGAAAATGA AAGTCACTCT 24780 

GGAGTGTTGG TGGAACTCGA GGGTGACAAC GCGCGCCTAG CCGTACTAAA ACGCAGCATC 24840 

GAGGTCACCC ACTTTGCCTA CCCGGCACTT AACCTACCCC CCAAGGTCAT GAGCACAGTC 24 900 

ATGAGTGAGC TGATCGTGCG CCGTGCGCAG CCCCTGGAGA GGGATGCAAA TTTGCAAGAA 24960 

CAAACAGAGG AGGGCCTACC CGCAGTTGGC GACGAGCAGC TAGCGCGCTG GCTTCAAACG 25020 

:„-CGCGAGCCTG CCGACTTGGA GGAGCGACGC AAACTAATGA TGGCCGCAGT GCTCGTTACC 25080 

-,:GTGGAGCTTG AGTGCATGCA GCGGTTCTTT GCTGACCCGG AGATGCAGCG CAAGCTAGAG 2S140 

GAAACATTGC ACTACACCTT TCGACAGGGC TACGTACGCC AGGCCTGCAA GATCTCCAAC 25200 

GTGGAGCTCT GCAACCTGGT CTCCTACCTT GGAATTTTGC ACGAAAACCG CCTTGGGCAA 25260 

.;,AACGTGCTTC ATTCCACGCT CAAGGGCGAG GCGCGCCGCG ACTACGTCCG CGACTGCGTT 25320 

-TACTTATTTC TATGCTACAC CTGGCAGACG GCCATGGGCG TTTGGCAGCA GTGCTTGGAG 25380 

...GAGTGCAACC TCAAGGAGCT GCAGAAACTG CTAAAGCAAA ACTTGAAGGA CCTATGGACG 25440 

GCCTTCAACG AGCGCTCCGT GGCCGCGCAC CTGGCGGACA TCATTTTCCC CGAACGCCTG 25500 

CTTAAAACCC TGCAACAGGG TCTGCCAGAC TTCACCAGTC AAAGCATGTT GCAGAACTTT 25560 

AGGAACTTTA TCCTAGAGCG CTCAGGAATC TTGCCCGCCA CCTGCTGTGC ACTTCCTAGC 25620 

GACTTTGTGC CCATTAAGTA CCGCGAATGC CCTCCGCCGC TTTGGGGCCA CTCCTACCTT 25680 
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CTGCAGCTAG CCAACT>^^ 

GGTCTACTGG AGTGTCACTG 

AATTCGCAGC TGCTTAACGA 

CCTGACGAAA AGTCGGCGGC 

TACCTTCGCA AATTTGTACC 

CAATCCCGCC CGCCAAATGC 

GGCCAATTGC AAGCCATCAA 

GTTTACTTGG ACCCCCAGTC 

TATCAGCAGC AGCCGCGGGC 

GCCGCCGCCA CCCACGGACG 

CGAGGAGGAG GAGGACATGA 

CGAAGAGGTG TCAGACGAAA 

GAAATCGGCA ACCGGTTCCA 

GCCCGTTCGC CGACCCAACC 

GCAGCCGCCG CCGTTAGCCC 

GCACAAGAAC GCCATAGTTG 

CCGCTTTCTT CTCTACCATC 

TCATCTCTAC AGCCCATACT 

CACAGAAGCA AAGGCGACCG 

CGGCAGCAGC AGGAGGAGGA 

AGCTTAGAAA CAGGATTTTT 

AACAAGAGCT GAAAATAAAA 

ACAAAAGCGA AGATCAGCTT 

ACTGCGCGCT GACTCTTAAG 

TACGTCATCT CCAGCGGCCA 

AGGAAATTCC CACGCCCTAC 

CTGCCCAAGA CTACTCAACC 

GGGTCAACGG AATCCGCGCC 

CCACACCTCG TAATAACCTT 

GTCCCGCTCC CACCACTGTG 

ACTCAGGGGC GCAGCTTGCG 

TAACTCACCT GACAATCAGA 

CGCTTGGTCT CCGTCCGGAC 

CGCCTCGTCA GGCAATCCTA 

TTGGAACTCT GCAATTTATT 



PCT/US97/19541 



TGCCTACCAC 


TCTGACATAA 


TGGAAGACGT 


GAGCGGTGAC 


25740 


TCGCTGCAAC 


CTATGCACCC 


CGCACCGCTC 


CCTGGTTTGC 


25800 


AAGTCAAATT 


ATCGGTACCT 


TTGAGCTGCA 


GGGTCCCTCG 


25860 


TCCGGGGTTG 


AAACTCACTC 


CGGGGCTGTG 


GACGTCGGCT 


25920 


TGAGGACTAC 


CACGCCCACG 


AGATTAGGTT 


CTACGAAGAC 


25980 


GGAGCTTACC 


GCCTGCGTCA 


TTACCCAGGG 


CCACATTCTT 


26040 


CAAAGCCCGC 


CAAGAGTTTC 


TGCTACGAAA 


GGGACGGGGG 


26100 


CGGCGAGGAG 


CTCAACCCAA 


TCCCCCCGCC 


GCCGCAGCCC 


26160 


CCTTGCTTCC 


CAGGATGGCA 


CCCAAAAAGA 


AGCTGCAGCT 


26220 


AGGAGGAATA 


CTGGGACAGT 


CAGGCAGAGG 


AGGTTTTGGA 


26280 


TGGAAGACTG 


GGAGAGCCTA 


GACGAGGAAG 


CTTCCGAGGT 


26340 


CACCGTCACC 


CTCGGTCGCA 


TTCCCCTCGC 


CGGCGCCCCA 


26400 


GCATGGCTAC 


AACCTCCGCT 


CCTCAGGCGC 


CGCCGGCACT 


26460 


GTAGATGGGA 


CACCACTGGA 


ACCAGGGCCG 


GTAAGTCCAA 


26520 


AAGAGCAACA 


ACAGCGCCAA 


GGCTACCGCT 


CATGGCGCGG 


26580 


CTTGCTTGCA 


AGACTGTGGG 


GGCAACATCT 


CCTTCGCCCG 


26640 


ACGGCGTGGC 


CTTCCCCCGT 


AACATCCTGC 


ATTACTACCG 


26700 


GCACCGGCGG 


CAGCGGCAGC 


GGCAGCAACA 


GCAGCGGCCA 


26760 


GATAGCAAGA 


CTCTGACAAA 


GCCCAAGAAA 


TCCACAGCGG 


26820 


GCGCTGCGTC 


TGGCGCCCAA 


CGAACCCGTA 


TCGACCCGCG 


26880 


CCCACTCTGT 


ATGCTATATT 


TCAACAGAGC 


AGGGGCCAAG 


26940 


AACAGGTCTC 


TGCGATCCCT 


CAGCGGCAGC 


TGCCTGTATC 


27000 


CGGCGCACGC 


TGGAAGACGC 


GGAGGCTCTC 


TTCAGTAAAT 


27060 


GACTAGTTTC 


GCGCCCTTTC 


TCAAATTTAA 


GCGCGAAAAC 


27120 


CACCCGGCGC 


CAGCACCTGT 


CGTCAGCGCC 


ATTATGAGCA 


27180 


ATGTGGAGTT 


ACCAGCCACA 


AATGGGACTT 


GCGGCTGGAG 


27240 


CGAATAAACT 


ACATGAGCGC 


GGGACCCCAC 


ATGATATCCC 


27300 


CACCGAAACC 


GAATTCTCTT 


GGAACAGGCG 


GCTATTACCA 


27360 


AATCCCCGTA 


GTTGGCCCGC 


TGCCCTGGTG 


TACCAGGAAA 


27420 


GTACTTCCCA 


GAGACGCCCA 


GGCCGAAGTT 


CAGATGACTA 


27480 


GGCGGCTTTC 


GTCACAGGGT 


GCGGTCGCCC 


GGGCAGGGTA 


27540 


GGGCGAGGTA 


TTCAGCTCAA 


CGACGAGTCG 


GTGAGCTCCT 


27600 


GGGACATTTC 


AGATCGGCGG 


CGCCGGCCGT 


CCTTCATTCA 


27660 


ACTCTGCAGA 


CCTCGTCCTC 


TGAGCCGCGC 


TCTGGAGGCA 


27720 


GAGGAGTTTG 


TGCCATCGGT 


CTACTTTAAC 


CCCTTCTCGG 


27780 
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GACCTCCC3G CCACT^^G GATCAATTTA TTCCTAACTT TGACGCGGTA AAGGACTCGG 
CGGACGGCTA CGACTGAATG TTAAGTGGAG AGGGAGAGCA ACTGCGCCTG ,:^CACCTGG 
TCCACTGTGG CCGCCACAAG TGCTTTGCCC GGGACTCCGG TGAGTTTTGC TACTTTGAAT 
TGCGCGAGGA TCATATCGAG GGCGGGGGGG ACGGCGTCCG GCTTACCGCC CAGGGAGAGC 
TTGCCCGTAG CCTGATTCGG GAGTTTACCC AGCGCCCCCT GCTAGTTGAG CGGGAGAGGG 
GACCCTGTGT TCTCACTGTG ATTTGCAACT GTCCTAACCT TGGATTACAT GAAGATCTTT 
GTTGCCA7CT CTGTGCTGAG TATAATAAAT ACAGAAATTA AAATATACTG GGGCTCCTAT 
CGCGATCCTG TAAACGCCAC CGTCTTCACC CGCCCAAGCA AAGC.AAGGCG AACGTTACCT 
GGTACTTTTA ACATCTCTCG CTCTGTGATT TACAACAGTT TCAACCCAGA CGGAGTGAGT 
CTAGGAGAGA ACCTCTCCGA GGTCAGCTAC TCCATCAGAA AAAAGACCAC CGTCCTTACC 
TGGCGGGAAC GTACGAGTGC GTCACCGGCC GCTGCACCAC ACCTACCGCC TGACCGTAAA 
CGAGACTTTT TCCGGACAGA CCTCAATAAC TCTGTTTACC AGAACAGGAG GTGAGCTTAG 
AAAACCCTTA GGGTATTAGG CCAAAGGCGC AGCTACTGTG GGGTTTATGA ACAATTCAAG 
GAACTCTACG GGCTATTCTA ATTCAGGTTT CTCTAGAATC GGGGTTGGGG TTATTCTCTG 
TCTTGTGATT CTCTTTATTC TTATACTAAC GCTTCTCTGC CTAAGGCTCG CCGCCTGCTG 
TGTGCACATT TGCATTTATT GTCAGCTTTT T.AAACGCTGG GGTCGCCAGC CAAGATGATT ' 
AGGTACA7.AA TCCTAGGTTT ACTCACCCTT GCGTCAGCCC ACGGTACCAC CCAAAAGGTG 
GATTTT.A.Z.GG AGCCAGCCTG TAATGTTACA TTCGCAGCTG AAGCTA.ATGA GTGCACCACT 
CTTATAA.AAT GCACCACAGA ACATGAAAAG CTGCTTATTC GCCACAAAAA CAAAATTGGC 
AAGTATGCTG TTTATGCTAT TTGGCAGCCA GGTGACACTA CAGAGTATAA TGTTACAGTT 
TTCCAGGGTA AAAGTCATAA AACTTTTATG TATACTTTTC CATTTTATGA AATGTGCGAC 
ATTACCATGT ACATGAGCAA ACAGTATAAG TTGTGGCCCC CACAAAATTG TGTGGAAAAC 
ACTGGGACTT TCTGCTGCAG TGGTATGCTA ATTACAGTGC TCGCTTTGGT CTGTACCCTA 
CTCTATATTA AATACAAAAG CAGACGCAGC TTTATTGAGG AAAAGAAAAT GCCTTAATTT 
:ACTAAG7TAG AAAGCTAATG TCACCACTAA GTGCTTTACT CGCTGCTTGC AAAACAAATT 
.CAAAAAGTTA GCATTATAAT TAGAATAGGA TTTAAACCCC CCGGTCATTT CCTGCTCAAT 
ACCATTCCCG TGAACAATTG ACTCTATGTG GGATATGCTC CAGCGCTACA ACCTTGAAGT 
CAGGCTTCCT GGATGTCAGC ATCTGACTTT GGCCAGCACC TGTCCCGCGG ATTTGTTCCA 
^GTCCAACTAC AGCGACCCAC CCTAACAGAG ATGACCAACA CAACCAACGC GGCCGCCGCT 
ACCGGACTTA CATCTACCAC AAATACACCC CAAGTTTCTG CCTTTGTCAA TAACTGGGAT 
,AACTTGGGCA TGTGGTGGTT CTCCATAGCG CTTATGTTTG TATGCCTTAT TATTATGTGG 
CTCATCTGCT GCCTAAAGCG CAAACGCGCC CGACCACCCA TCTATAGTGC CATCATTGTG 
CTACACCCAA ACAATGATGG AATCCATAGA TTGGACGGAC TGAAACACAT GTTCTTTTCT 
CTTACAGTAT GATTAAATGA GACATGATTC CTCGAGTTTT TATATTACTG ACCCTTGTTG 
CGCTTTTTTG TGCGTGCTCG ACATTGGCTG CGGTTTCTCA CATCGAAGTA GACTGCATTC 



27840 
:7900 
27960 
28020 
28080 
28140 
28200 
28260 
28320 
28380 
28440 
28500 
28560 
28620 
28680 
28740 
28800 
28860 
28920 
28980 
29040 
29100 
29160 
29220 
29280 
29340 
29400 
29460 
29520 
29580 
29640 
29700 
29760 
29820 
29880 
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CAGCCTTCAC 

TCACTGTGGT 
TCAGACACCA 
TATGAAATTT 
GACCTCCAAG 
TTGCTACAAT 
GGTGTTCTGC 
ACGAATAGAT 
AGTTGTTGCC 
TGAAATCAGC 
ACGGAATTAT 
GCATGAATCA 
GTCTGGTAAA 
ACAAGTTGCC 
TAACTCAGCA 
ATCTCTGCAC 
AAAAAAAAAT 
TTCAGCAGCA 
AACTTTCTCC 
ACTATCTTCA 
GTGTATCCAT 
GTATCCCCCA 
CCTCTAGTTA 
GAGGCCGGCA 
AAGTCAAACA 
GTGGCTGCCG 
CCGCTAACCG 
GAAGGAAAGC 
ACTATCACTG 
GAGCCCATTT 
ACAGACGACC 
TCCTTGCAAA 
AATGTAGCAG 
TATCCGTTTG 
AACTCAGCCC 



AGTCTATTTG 
CATCGCCTTT 
TCCCCAGTAC 
ACTGTGACTT 
CCTCAAAGAC 
GAAAAAAGCG 
AGTACCATCT 
GCCATGAACC 
GGCGGCTTTG 
TACTTTAATC 
TACAGAGCAG 
AGAGCTCCAA 
GCAGGCCAAA 
AACCAAGCGT 
CTCGGTAGAA 
CCTTATTAAG 
AATAAAGCAT 
CCTCCTTGCC 
ACAATCTAAA 
TGTTGTTGCA 
ATGACACGGA 
ATGGGTTTCA 
CCTCCAATGG 
ACCTTACCTC 
TAAACCTGGA 
CCGCACCTCT 
TGCACGACTC 
TAGCCCTGCA 
CCTCACCCCC 
ATACACAAAA 
TAAACACTTT 
CTAAAGTTAC 
GAGGACTAAG 
ATGCTCAAAA 
ACAACTTGGA 



CTTTACGGAT 
ATCCAGTGCA 
AGGGACAGGA 
TTCTGCTGAT 
ATATATCATG 
ATCTTTCCGA 
TAGCCCTAGC 
ACCCAACTTT 
TCCCAGCCAA 
TAACAGGAGG 
CGCCTGCTAG 
GACATGGTTA 
GTCACCTACG 
CAGAAATTGG 
ACCGAAGGCT 
ACCCTGTGCG 
CACTTACTTA 
CTCCTCCCAG 
TGGAATGTCA 
GATGAAGCGC 
AACCGGTCCT 
AGAGAGTCCC 
CATGCTTGCG 
CCAAAATGTA 
AATATCTGCA 
AATGGTCGCG 
CAAACTTAGC 
AACATCAGGC 
TCTAACTACT 
TGGAAAACTA 
GACCGTAGCA 
TGGAGCCTTG 
GATTGATTCT 
CCAACTAAAT 
TATTAACTAC 



TTGTCACCCT 
TTGACTGGGT 
CTATAGCTGA 
TATTTGCACC 
CAGATTCACT 
AGCCTGGTTA 
TATATATCCC 
CCCCGCGCCC 
TCAGCCTCGC 
AGATGACTGA 
7VAAGACGCAG 
ACTTGCACCA 
ACAGTAATAC 
TGGTCATGGT 
GCATTCACTC 
GTCTCAAAGA 
AAATCAGTTA 
CTCTGGTATT 
GTTTCCTCCT 
GCAAGACCGT 
CCAACTGTGC 
CCTGGGGTAC 
CTCAAAATGG 
ACCACTGTGA 
CCCCTCACAG 
GGCAACACAC 
ATTGCCACCC 
CCCCTCACCA 
GCCACTGGTA 
GGACTAAAGT 
ACTGGTCCAG 
GGTTTTGATT 
CAAAACAGAC 
CTAAGACTAG 
AACAAAGGCC 



CACGCTCATC 
CTGTGTGCGC 
GCTTCTTAGA 
CTATCTGCGT 
CGTATATGGA 
TATGCAATCA 
TACCTTGACA 
GCTATGCTTC 
CCCACTTCTC 
CACCCTAGAT 
GGCAGCGGCC 
GTGCAAAAGG 
CACCGGACAC 
GGGAGAAAAG 
ACCTTGTCAA 
TCTTATTCCC 
GCAAATTTCT 
GCAGCTTCCT 
GTTCCTGTCC 
CTGAAGATAC 
CTTTTCTTAC 
TCTCTTTGCG 
GCAACGGCCT 
GCCCACCTCT 
TTACCTCAGA 
TCACCATGCA 
AAGGACCCCT 
CCACCGATAG 
GCTTGGGCAT 
ACGGGGCTCC 
GTGTGACTAT 
CACAAGGCAA 
GCCTTATACT 
GACAGGGCCC 
TTTACTTGTT 



PCT/US97/ 19541 

TGCAGCCTCA 2 9 940 

TTTGCATATC 3 000 0 

ATTCTTTAAT 3 0 060 

TTTGTTCCCC 3 012 0 

ATATTCCAAG 3 018 0 

TCTCTGTTAT 3 0240 

TTGGCTGGAA 3 03 00 

CACTGCAACA 3 0 360 

CCACCCCCAC 3 0420 

CTAG AAATGG 3 04 8 0 

GAGCAACAGC 3 0 540 

GGTATCTTTT 3 0600 

CGCCTTAGCT 3 0660 

CCCATTACCA 3 0720 

GG ACCTGAGG 30 78 0 

TTTAACTAuAT 3 0840 

GTCCAGTTTA 3 0 900 

CCTGGCTGCA 3 0 960 

ATCCGCACCC 31020 

CTTCAACCCC 31080 

TCCTCCCTTT 3114 0 

CCTATCCGAA 31200 

CTCTCTGGAC 31260 

CAAAAAAACC 31320 

AGCCCTAACT 313 80 

ATCACAGGCC 31440 

CACAGTGTCA 31500 

C AGTACCCTT 3156 0 

TGACTTGAAA 316 2 0 

TTTGCATGTA 316 8 0 

TAATAATACT 31740 

TATGCAACTT 31800 

TGATGTTAGT 31860 

TCTTTTTATA 31920 

TACAGCTTCA 31980 
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AACAA.TCCA AAAAGC^R GGTTAACCTA AGCACTGCCA AGGGGTTGAT GTTTGACGCT 3 2 040 

ACAGCCATAG CCATTAATGC AGGAGATGGG CTTGAATTTG GTTCACCTAA TGCACCAAAC 3 2100 

ACAA.iTCCCC TCAAAACAAA AATTGGCCAT GGCCTAGAAT TTGATTCAAA CAAGGCTATG 3 216 0 

GTTCCTAAAC TAGGAACTGG CCTTAGTTTT GACAGCACAG GTGCCATTAC AGTAGGAAAC 3 2 220 

AAAAATAATG ATAAGCTAAC TTTGTGGACC ACACCAGCTC CATCTCCTAA CTGTAGACTA 3 2 280 

AATGCAGAGA AAGATGCTAA ACTCACTTTG GTCTTAACAA AATGTGGCAG TCAAATACTT 3234 0 

GCTACAGTTT CAGTTTTGGC TGTTAAAGGC AGTTTGGCTC CAATATCTGG AACAGTTCAA 3 2 400 

AGTGCTCATC TTATTATAAG ATTTGACGAA AATGGAGTGC TACTAAACAA TTCCTTCCTG 3246 0 

GACCCAGAAT ATTGGAACTT TAGAAATGGA GATCTTACTG AAGGCACAGC CTATACAAAC 3 2 520 

GCTGTTGGAT TTATGCCTAA CCTATCAGCT TATCCAAAAT CTCACGGTAA AACTGCCAAA 32 580 

AGTAACATTG TCAGTCAAGT TTACTTAAAC GGAGACAAAA CTAAACCTGT AACACTAACC 3264 0 

ATTACACTAA ACGGTACACA GGAAACAGGA GACACAACTC CAAGTGCATA CTCTATGTCA 32 700 

TTTTCATGGG ACTGGTCTGG CCACAACTAC ATTAATGAAA TATTTGCCAC ATCCTCTTAC 3 2 760 

ACTTTTTCAT ACATTGCCCA AGAATAAAGA ATCGTTTGTG TTATGTTTCA ACGTGTTTAT 3282 0 

TTTTCAATTG CAGAAAATTT CAAGTCATTT TTCATTCAGT AGTATAGCCC CACCACCACA 328 80 

TAGCTTATAC AGATCACCGT ACCTTAATCA AACTCACAGA ACCCTAGTAT TCAACCTGCC 3 2 940 

ACCTCCCTCC CAACACACAG AGTACACAGT CCTTTCTCCC CGGCTGGCCT TAAAAAGCAT 3 3 000 

CATATCATGG GTAACAGACA TATTCTTAGG TGTTATATTC CACACGGTTT CCTGTCGAGC 3 3 060 

CAAACGCTCA TCAGTGATAT TAATAAACTC CCCGGGCAGC TCACTTAAGT TCATGTCGCT 3 312 0 

GTCCAGCTGC TGAGCCACAG GCTGCTGTCC AACTTGCGGT TGCTTAACGG GCGGCGAAGG 3 3180 

AGAAGTCCAC GCCTACATGG GGGTAGAGTC ATAATCGTGC ATCAGGATAG GGCGGTGGTG 3 3 240 

CTGCAGCAGC GCGCGAATAA ACTGCTGCCG CCGCCGCTCC GTCCTGCAGG AATACAACAT 3 3 300 

GGCAGTGGTC TCCTCAGCGA TGATTCGCAC CGCCCGCAGC ATAAGGCGCC TTGTCCTCCG 3 3 360 

GGCACAGCAG CGCACCCTGA TCTCACTTAA ATCAGCACAG TAACTGCAGC ACAGCACCAC 3342 0 

AATATTGTTC AAAATCCCAC AGTGCAAGGC GCTGTATCCA AAGCTCATGG CGGGGACCAC 3 34 80 

AGAACCCACG TGGCCATCAT ACCACAAGCG CAGGTAGATT AAGTGGCGAC CCCTCATAAA 3 3 540 

CACGCTGGAC ATAAACATTA CCTCTTTTGG CATGTTGTAA TTCACCACCT CCCGGTACCA 3 3600 

TATAAACCTC TGATTAAACA TGGCGCCATC CACCACCATC CTAAACCAGC TGGCCAAAAC 3 3 660 

CTGCCCGCCG GCTATACACT GCAGGGAACC GGGACTGGAA CAATGACAGT GGAGAGCCCA 3 3 720 

GGACTCGTAA CCATGGATCA TCATGCTCGT CATGATATCA ATGTTGGCAC AACACAGGCA 3 3 7SO 

CACGTGCATA CACTTGCTCA GGATTACAAG CTCCTCCCGC GTTAGAACCA TATCCCAGGG 3 3 840 

AACAACCCAT TCCTGAATCA GCGTAAATCC CACACTGCAG GGAAGACCTC GCACGTAACT 3 3 900 

CACGTTGTGC ATTGTCAAAG TGTTACATTC GGGCAGCAGC GGATGATCCT CCAGTATGGT 3 3 960 

AGCGCGGGTT TCTGTCTCAA AAGGAGGTAG ACGATCCCTA CTGTACGGAG TGCGCCGAGA 34020 

CAACCGAGAT CGTGTTGGTC GTAGTGTCAT GCCAAATGGA ACGCCGGACG TAGTCATATT 3 4 080 
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TCCTGAAGCA AAACCAG^G CGGGCGTGAC AAACAGATCT 

TAGATCGCTC TGTGTAGTAG TTGTAGTATA TGCACTCTCT 

TGGCTTCGGG TTCTATGTAA ACTCCTTCAT GCGCCGCTGC 

CAGAATAAGC CACACCCAGC CAACCTACAC ATTCGTTCTG 

CGGGAAGAGC TGGAAGAACC ATGTTTTTTT TTTTATTCCA 

AAATGAAGAT CTATTAAGTG AACGGGCTGC CCTCCGGTGG 

AAAGAACAGA TAATGGCATT TGTAAGATGT TGCACAATGG 

CTCACGTCCA AGTGGACGTA AAGGCTAAAC CCTTCAGGGT 

CCAGCACCTT CAACCATGCC CAAATAATTC TCATCTCGCC 

AGCAAATCCC GAATATTAAG TCCGGCCATT GTAAAAATCT 

TTCAGCCTCA AGCAGCGAAT CATGATTGCA AAAATTCAGG 

GATTCAAAAG CGGAACATTA ACAAAAATAC CGCGATCCCG 

GCTGAACATA ATCGTGCAGG TCTGCACGGA CCAGCGCGGC 

TGACAAAAGA ACCCACACTG ATTATGACAC GCATACTCGG 

CCCCGATGTA AGCTTTGTTG CATGGGCGGC GATATAAAAT 

TCAGGC.^AAG CCTCGCGCAA AAAAGAAAGC ACATCGTAGT 

CAGGTAAGCT CCGGAACCAC CACAGAAAAA GACACCATTT 

GGTTTCTGCA TAAACACAAA ATAAAATAAC AAAAAAACAT 

TTACAACAGG AAAAACAACC CTTATAAGCA TAAGACGGAC 

CGTAAAAAAA CTGGTCACCG TGATTAAAAA GCACCACCGA 

GAGTCATAAT GTAAGACTCG GTAAACACAT CAGGTTGATT 

CGACCGAAAT AGCCCGGGGG AATACATACC CGCAGGCGTA 

ATAGGAGGTA TAACAAAATT AATAGGAGAG AAAAACACAT 

TGCCTAGGCA AAATAGCACC CTCCCGCTCC AGAACAACAT 

CCTAACAGTC AGCCTTACCA GTAAAAAAGA AAACCTATTA 

GCACCAGCTC AATCAGTCAC AGTGTAAAAA AGGGCCAAGT 

ACTAAAAAAT GACGTAACGG TTAAAGTCCA CAAAAAACAC 

CTACGCCCAG AAACGAAAGC CAAAAAACCC ACAACTTCCT 

CCCACGTTAC GTAACTTCCC ATTTTAAGAA AACTACAATT 

CCGCCCTAAA ACCTACGTCA CCCGCCCCGT TCCCACGCCC 

ACCCCCTCAT TATCATATTG GCTTCAATCC AAAATAAGGT 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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GCQTCTCCGG TCTCGCCGCT 34 14 0 

CAAAGCATCC AGGCGCCCCC 34 2 00 

CCTGATAACA TCCACCACCG 34 2 60 

CGAGTCACAC ACGGGAGGAG 34 32 0 

AAAGATTATC CAAAACCTCA 34 3 80 

CGTGGTCAAA CTCTACAGCC 3444 0 

CTTCCAAAAG GCAAACGGCC 3 4 500 

GAATCTCCTC TATAAACATT 3 4 560 

ACCTTCTCAA TATATCTCTA 34 62 0 

GCTCCAGAGC GCCCTCCACC 34 680 

TTCCTCACAG ACCTGTATAA 3 4 740 

TAGGTCCCTT CGCAGGGCCA 34 8 00 

CACTTCCCCG CCAGGAACCT 34 86 0 

AGCTATGCTA ACCAGCGTAG 3 4 920 

GCAAGGTGCT GCTCAAAAAA 3 4 980 

CATGCTCATG CAGATAAAGG 3 5040 

TTCTCTCAAA CATGTCTGCG 3 5100 

TTAAACATTA GAAGCCTGTC 3 516 0 

TACGGCCATG CGGGCGTGAC 3 5 220 

CAGCTCCTCG GTCATGTCCG 3 5 280 

CATCGGTCAG TGCTAAAAAG 3 5 340 

GAGACAACAT TACAGCCCCC 3 5400 

AAACACCTGA AAAACCCTCC 3 5460 

ACAGCGCTTC ACAGCGGCAG 3 5 520 

AAAAAACACC ACTCGACACG 3 5580 

GCAGAGCGAG TATATATAGG 3 5640 

CCAGAAAACC GCACGCGAAC 3 5700 

CAAATCGTCA CTTCCGTTTT 3 5 760 

CCCAACACAT ACAAGTTACT 3 582 0 

CGCGCCACGT CACAAACTCC 3 5880 

ATATTATTGA TGATG 3 5 935 
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(11) MOLECULE^^FE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

txi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

TTCATTTTAT GTTTCAGGTT CAGGG 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHAJUVCTER I STI CS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(11) MOLECXn^E TYPE: other nucleic acid 
(A) DESCRIPTION: /desc ^ "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

TTACCGCCAC ACTCGCAGGG 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34303 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
, (D) TOPOLOGY: linear 

(11) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

TTATTTTGGA TTGAAGCCAA TATGATAATG AGGGGGTGGA GTTTGTGACG TGGCGCGGGG 

CGTGGGAACG GGGCGGGTGA CGTAGTAGTG TGGCGGAAGT GTGATGTTGC AAGTGTGGCG 

GAACACATGT AAGCGACGGA TGTGGCAAAA GTGACGTTTT TGGTGTGCGC CGGATCCACA 

GGACGGGTGT GGTCGCCATG ATCGCGTAGT CGATAGTGGC TCCAAGTAGC GAAGCGAGCA 

GGACTGGGCG GCGGCCAAAG CGGTCGGACA GTGCTCCGAG AACGGGTGCG CATAGAAATT 

GCATCAACGC ATATAGCGCT AGCAGCACGC CATAGTGACT GGCGATGCTG TCGGAATGGA 

CGATATCCCG CAAGAGGCCC GGCAGTACCG GCATAACCAA GCCTATGCCT ACAGCATCCA 

GGGTGACGGT GCCGAGGATG ACGATGAGCG CATTGTTAGA TTTCATACAC GGTGCCTGAC 

TGCGTTAGCA ATTTAACTGT GATAAACTAC CGCATTAAAG CTTATCGATG ATAAGCTGTC 

AAACATGAGA ATTCTTGAAG ACGAAAGGGC CTCGTGATAC GCCTATTTTT ATAGGTTAAT 

GTCATGATAA TAATGGTTTC TTAGACGTCA GGTGGCACTT TTCGGGGAAA TGTGCGCGGA 

ACCCCTATTT GTTTATTTTT CTAAATACAT TCAAATATGT ATCCGCTCAT GAGACAATAA 

CCCTGATAAA TGCTTCAATA ATATTGAAAA AGGAAGAGTA TGAGTATTCA ACATTTCCGT 

GTCGCCCTTA TTCCCTTTTT TGCGGCATTT TGCCTTCCTG TTTTTGCTCA CCCAGAAACG 

CTGGTGAAAG TAAAAGATGC TGAAGATCAG TTGGGTGCAC GAGTGGGTTA CATCGAACTG 

GATCTCAACA GCGGTAAGAT CCTTGAGAGT TTTCGCCCCG AAGAACGTTT TCCAATGATG 

AGCACTTTTA AAGTTCTGCT ATGTGGCGCG GTATTATCCC GTGTTGACGC CGGGCAAGAG 

CAACTCGGTC GCCGCATACA CTATTCTCAG AATGACTTGG TTGAGTACTC ACCAGTCACA 
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GAAAAGC.-.TC TTACGGATGG CATGACAGTA AGAGAATTAT GCAGTGCTGC CATAACGATG 114 0 

AGTGATA.^CA CTGCGGCCAA CTTACTTCTG ACAACGATCG GAGGACCGAA GGAGCTAACC 12 00 

GCTTTTTTGC ACAACATGGG GGATCATGTA ACTCGCCTTG ATCGTTGGGA ACGGGAGCTG 126 0 

AATGAAGCCA TACGAAACGA CGAGCGTGAC ACCACGATGC CTGCAGCAAT GGGAACAACG 132 0 

TTGCGCA.^C TATTAACTGG CGAACTACTT ACTCTAGCTT CCCGGCAACA ATTAATAGAC 13 8 0 

TGGATGGAGG CGGATAAAGT TGCAGGACCA CTTCTGCGCT CGGGCCTTCC GGCTGGCTGG 14 4 0 

TTTATTGCTG ATAAATCTGG AGCCGGTGAG CGTGGGTCTC GCGGTATCAT TGCAGCACTG 15 00 

GGGCCAGATG GTAAGCCCTC GCGTATCGTA GTTATCTACA CGACGGGGAG TCAGGCAACT 156 0 

ATGGATGAAC GAAATAGACA GATCGCTGAG ATAGGTGCCT CACTGATTAA GCATTGGTAA 16 2 0 

CTGTCAGACC AAGTTTACTC ATATATACTT TAGATTGATT TAAAACTTCA TTTTTAATTT 16 8 0 

AAAAGGATCT AGGTGAAGAT CCTTTTTGAT AATCTCATGA CGAAAATCCC TTAACGTGAG 174 0 

TTTTCGTTCC ACTGAGCGTC AGACCCCGTA GAAAAGATCA AAGGATCTTC TTGAGATCCT 18 0 0 

TTTTTTC7GC GCGTAATCTG CTGCTTGCAA ACAAAAAAAC CACCGCTACC AGCGGTGGTT 18 6 0 

TGTTTGCCGG ATCAAGAGCT ACCAACTCTT TTTCCGAAGG TAACTGGCTT GAGCAGAGCG 192 0 

CAGATACCAA ATACTGTCCT TCTAGTGTAG CCGTAGTTAG GCCACCACTT CAAGAACTCT 198 0 

GTAGCACCGC CTACATACCT CGCTCTGCTA ATCCTGTTAC CAGTGGCTGC TGCCAGTGGC 204 0 

GATAAGTCGT GTCTTACCGG GTTGGACTCA AGACGATAGT TACCGGATAA GGCGCAGCGG 210 0 

TCGGGCTGAA CGGGGGGTTC GTGCACACAG CCCAGCTTGG AGCGAACGAC CTACACCGAA 216 0 

CTGAGATACC TACAGCGTGA GCATTGAGAA AGCGCCACGC TTCCCGAAGG GAGAAAGGCG 222 0 

GACAGGTATC CGGTAAGCGG CAGGGTCGGA ACAGGAGAGC GCACGAGGGA GCTTCCAGGG 228 0 

GGAAACGCCT GGTATCTTTA TAGTGCTGTC GGGTTTCGCC ACCTCTGACT TGAGCGTCGA 234 0 

TTTTTGTGAT GCTCGTCAGG GGGGCGGAGC CTATGGAAAA ACGCCAGCAA CGCGGCCTTT 2 4 00 

TTACGGTTCC TGGCCTTTTG CTGGCCTTTT GCTCACATGT TCTTTCCTGC GTTATCCCCT 246 0 

GATTCTGTGG ATAACCGTAT TACCGCCTTT GAGTGAGCTG ATACCGCTCG CCGCAGCCGA 2 52 0 

ACGACCGAGC GCAGCGAGTC AGTGAGCGAG GAAGCGGAAG AGCGCCTGAT GCGGTATTTT 2 58 0 

CTCCTTACGC ATCTGTGCGG TATTTCACAC CGCATATGGT GCACTCTCAG TACAATCTGC 2 64 0 

TCTGATGCCG CATAGTTAAG CCAGTATACA CTCCGCTATC GCTACGTGAC TGGGTCATGG 2 700 

CTGCGCCCCG ACACCCGCCA ACACCCGCTG ACGCGCCCTG ACGGGCTTGT CTGCTCCCGG 2 760 

CATCCGCTTA CAGACAAGCT GTGACCGTCT CCGGGAGCTG CATGTGTCAG AGGTTTTCAC 282 0 

CGTCATCACC GAAACGCGCG AGGCAGTCTA GACAATAGTA GTACGGATAG CTGTGACTCC 2 8 80 

GGTCCTTCTA ACACACCTCC TGAGATACAC CCGGTGGTCC CGCTGTGCCC CATTAAACCA 2 94 0 

GTTGCCGTGA GAGTTGGTGG GCGTCGCCAG GCTGTGGAAT GTATCGAGGA CTTGCTTAAC 3 000 

GAGCCTGGGC AACCTTTGGA CTTGAGCTGT AAACGCCCCA GGCCATAAGG TGTAAACCTG 3 06 0 

TGATTGCGTG TGTGGTTAAC GCCTTTGTTT GCTGAATGAG TTGATGTAAG TTTAATAAAG 312 0 

GGTGAGATAA TGTTTAACTT GCATGGCGTG TTAAATGGGG CGGGGCTTAA AGGGTATATA 3180 
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ATGCGCCGTG GGCT^^TCTT GGTTACATCT GACCTCATGG AGGCTTGGGA GTGTTTGGAA 3 24 0 
GATTTTTCTG CTGTGCGTAA CTTGCTGGAA CAGAGCTCTA ACAGTACCTC TTGGTTTTGG 
AGGTTTCTGT GGGGCTCATC CCAGGCAAAG TTAGTCTGCA GAATTAAGGA GGATTACAAG 
TGGGAATTTG AAGAGCTTTT GAAATCCTGT GGTGAGCTGT TTGATTCTTT GAATCTGGGT 

CACCAGGCGC TTTTCCAAGA GAAGGTCATC AAGACTTTGG ATTTTTCCAC ACCGGGGCGC 3 4 80 

GCTGCGGCTG CTGTTGCTTT TTTGAGTTTT ATAAAGGATA AATGGAGCGA AGAAACCCAT 3540 

CTGAGCGGGG GGTACCTGCT GGATTTTCTG GCCATGCATC TGTGGAGAGC GGTTGTGAGA 3 6 00 

CACAAGAATC GCCTGCTACT GTTGTCTTCC GTCCGCCCGG CGATAATACC GACGGAGGAG 3660 

CAGCAGCAGC AGCAGGAGGA AGCCAGGCGG CGGCGGCAGG AGCAGAGCCC ATGGAACCCG 3 720 

AGAGCCGGCC TGGACCCTCG GGAATGAATG TTGTACAGGT GGCTGAACTG TATCCAGAAC 3 780 

TGAGACGCAT TTTGACAATT ACAGAGGATG GGCAGGGGCT AAAGGGGGTA AAGAGGGAGC 3 84 0 

GGGGGGCTTG TGAGGCTACA GAGGAGGCTA GGAATCTAGC TTTTAGCTTA ATGACCAGAC 3 900 
ACCGTCCTGA GTGTATTACT TTTCAACAGA TCAAGGATAA TTGCGCTAAT GAGCTTGATC 
TGCTGGCGCA GAAGTATTCC ATAGAGCAGC TGACCACTTA CTGGCTGCAG CCAGGGGATG 
ATTTTGAGGA GGCTATTAGG GTATATGCAA AGGTGGCACT TAGGCCAGAT TGCAAGTACA 
AGATCAGCAA ACTTGTAAAT ATCAGGAATT GTTGCTACAT TTCTGGGAAC GGGGCCGAGG 
TGGAGATAGA TACGGAGGAT AGGGTGGCCT TTAGATGTAG CATGATAAAT ATGTGGCCGG 
GGGTGCTTGG CATGGACGGG GTGGTTATTA TGAATGTAAG GTTTACTGGC CCCAATTTTA 
GCGGTACGGT TTTCCTGGCC AATACCAACC TTATCCTACA CGGTGTAAGC TTCTATGGGT 
TTAACAATAC CTGTGTGGAA GCCTGGACCG ATGTAAGGGT TCGGGGCTGT GCCTTTTACT 

GCTGCTGGAA GGGGGTGGTG TGTCGCCCCA AAAGCAGGGC TTCAATTAAG AAATGCCTCT 44 4 0 

TTGAAAGGTG TACCTTGGGT ATCCTGTCTG AGGGTAACTC CAGGGTGCGC CACAATGTGG 4500 

CCTCCGACTG TGGTTGCTTC ATGCTAGTGA AAAGCGTGGC TGTGATTAAG CATAACATGG 4 56 0 

TATGTGGC^A CTGCGAGGAC AGGGCCTCTC AGATGCTGAC CTGCTCGGAC GGCAACTGTC 462 0 

ACCTGCTGAA GACCATTCAC GTAGCCAGCC ACTCTCGCAA GGCCTGGCCA GTGTTTGAGC 46 8 0 

ATAACATACT GACCCGCTGT TCCTTGCATT TGGGTAACAG GAGGGGGGTG TTCCTACCTT 4 74 0 

ACCAATGCAA TTTGAGTCAC ACTAAGATAT TGCTTGAGCC CGAGAGCATG TCCAAGGTGA 4 800 

ACCTGAACGG GGTGTTTGAC ATGACCATGA AGATCTGGAA GGTGCTGAGG TACGATGAGA 4860 

CCCGCACCAG GTGCAGACCC TGCGAGTGTG GCGGTAAACA TATTAGGAAC CAGCCTGTGA 4 920 

TGCTGGATGT GACGGAGGAG CTGAGGCCCG ATCACTTGGT GCTGGCCTGC ACCCGCGCTG 4 980 

AGTTTGGCTC TAGCGATGAA GATACAGATT GAGGTACTGA AATGTGTGGG CGTGGCTTAA 504 0 

GGGTGGGAAA GAATATATAA GGTGGGGGTC TTATGTAGTT TTGTATCTGT TTTGCAGCAG 5100 

CCGCCGCCGC CATGAGCACC AACTCGTTTG ATGGAAGCAT TGTGAGCTCA TATTTGACAA 516 0 

CGCGCATGCC CCCATGGGCC GGGGTGCGTC AGAATGTGAT GGGCTCCAGC ATTGATGGTC 522 0 
GCCCCGTCCT GCCCGCAAAC TCTACTACCT TGACCTACGA GACCGTGTCT GGAACGCCGT 
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TGGAGACTGC AGCCTCCGCC GCCGCTTCAG CCGCTGCAGC CACCGCCCGC GGGATTGTGA 5340 

CTGACTTTGC TTTCCTGAGC CCGCTTGCAA GCAGTGCAGC TTCCCGTTCA TCCGCCCGCG 5400 

ATGACAAGTT GACGGCTCTT TTGGCACAAT TGGATTCTTT GACCCGGGAA CTTAATGTCG 54 6 0 

TTTCTCAGCA GCTGTTGGAT CTGCGCCAGC AGGTTTCTGC CCTGAAGGCT TCCTCCCCTC 5 52 0 

CCAATGCGGT TTAAAACATA AATAAAAAAC CAGACTCTGT TTGGATTTGG ATCAAGCAAG 5 58 0 

TGTCTTGCTG TCTTTATTTA GGGGTTTTGC GCGCGCGGTA GGCCCGGGAC CAGCGGTCTC 5 64 0 

GGTCGTTGAG GGTCCTGTGT ATTTTTTCCA GGACGTGGTA AAGGTGACTC TGGATGTTCA 5 70 0 

GATACATGGG CATAAGCCCG TCTCTGGGGT GGAGGTAGCA CCACTGCAGA GCTTCATGCT 5 76 0 

GCGGGGTGGT GTTGTAGATG ATCCAGTCGT AGCAGGAGCG CTGGGCGTGG TGCCTAAAAA 582 0 

TGTCTTTCAG TAGCAAGCTG ATTGCCAGGG GCAGGCCCTT GGTGTAAGTG TTTACAAAGC 5880 

GGTTAAGCTG GGATGGGTGC ATACGTGGGG ATATGAGATG CATCTTGGAC TGTATTTTTA 5 94 0 

GGTTGGCTAT GTTCCCAGCC ATATCCCTCC GGGGATTCAT GTTGTGCAGA ACCACCAGCA 6 000 

CAGTGTATCC GGTGCACTTG GGAAATTTGT CATGTAGCTT AGAAGGAAAT GCGTGGAAGA 6 06 0 

ACTTGGAGAC GCCCTTGTGA CCTCCAAGAT TTTCCATGCA TTCGTCCATA ATGATGGCAA 612 0 

TGGGCCCACG GGCGGCGGCC TGGGCGAAGA TATTTCTGGG ATCACTAACG TCATAGTTGT 618 0 

GTTCCAGGAT GAGATCGTCA TAGGCCATTT TTACAAAGCG CGGGCGGAGG GTGCCAGACT 6 24 0 

GCGGTATAAT GGTTCCATCC GGCCCAGGGG CGTAGTTACC CTCACAGATT TGCATTTCCC 6 3 00 

ACGCTTTGAG TTCAGATGGG GGGATCATGT CTACCTGCGG GGCGATGAAG AAAACGGTTT 6 36 0 

CCGGGGTAGG GGAGATCAGC TGGGAAGAAA GCAGGTTCCT GAGCAGCTGC GACTTACCGC 6 420 

AGCCGGTGGG CCCGTAAATC ACACCTATTA CCGGGTGCAA CTGGTAGTTA AGAGAGCTGC 64 8 0 

AGCTGCCGTC ATCCCTGAGC AGGGGGGCCA CTTCGTTAAG CATGTCCCTG ACTCGCATGT 6 54 0 

TTTCCCTGAC CAAATCCGCC AGAAGGCGCT CGCCGCCCAG CGATAGCAGT TCTTGCAAGG 6 6 00 

AAGCAAAGTT TTTCAACGGT TTGAGACCGT CCGCCGTAGG CATGCTTTTG AGCGTTTGAC 6 66 0 

CAAGCAGTTC CAGGCGGTCC CACAGCTCGG TCACCTGCTC TACGGCATCT CGATCCAGCA 6 72 0 

TATCTCCTCG TTTCGCGGGT TGGGGCGGCT TTCGCTGTAC GGCAGTAGTC GGTGCTCGTC 6 78 0 

CAGACGGGCC AGGGTCATGT CTTTCCACGG GCGCAGGGTC CTCGTCAGCG TAGTCTGGGT 6 84 0 

CACGGTGAAG GGGTGCGCTC CGGGCTGCGC GCTGGCCAGG GTGCGCTTGA GGCTGGTCCT 6 900 

GCTGGTGCTG AAGCGCTGCC GGTCTTCGCC CTGCGCGTCG GCCAGGTAGC ATTTGACCAT 6 96 0 

GGTGTCATAG TCCAGCCCCT CCGCGGCGTG GCCCTTGGCG CGCAGCTTGC CCTTGGAGGA 7 02 0 

GGCGCCGCAC GAGGGGCAGT GCAGACTTTT GAGGGCGTAG AGCTTGGGCG CGAGAAATAC 7 080 

CGATTCCGGG GAGTAGGCAT CCGCGCCGCA GGCCCCGCAG ACGGTCTCGC ATTCCACGAG 714 0 

CCAGGTGAGC TCTGGCCGTT CGGGGTCAAA AACCAGGTTT CCCCCATGCT TTTTGATGCG 7 2 00 

TTTCTTACCT CTGGTTTCCA TGAGCCGGTG TCCACGCTCG GTGACGAAAA GGCTGTCCGT 7 26 0 

GTCCCCGTAT ACAGACTTGA GAGGCCTGTC CTCGAGCGGT GTTCCGCGGT CCTCCTCGTA 7 320 

TAGAAACTCG GACCACTCTG AGACAAAGGC TCGCGTCCAG GCCAGCACGA AGGAGGCTAA 7380 
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GTGGGAGGGG 


TAGCGGTrCT 


TGTCCACTAG 


GGGGTCCACT 


CGCTCCAGGG 


TGTGAAGACA 


7440 


CATGTCGCCC 


TCTTCGGCAT 


CAAGGAAGGT 


GATTGGTTTG 


TAGGTGTAGG 


CCACGTGACC 


7500 


GGGTGTTCCT 


GAAGGGGGGC 


TATAAAAGGG 


GGTGGGGGCG 


CGTTCGTCCT 


CACTCTCTTC 


7560 


CGCATCGCTG 


TCTGCGAGGG 


CCAGCTGTTG 


GGGTGAGTAC 


TCCCTCTGAA 


AAGCGGGCAT 


7620 


GACTTCTGCG 


CTAAGATTGT 


CAGTTTCCAA 


AAACGAGGAG 


GATTTGATAT 


TCACCTGGCC 


7680 


CGCGGTGATG 


CCTTTGAGGG 


TGGCCGCATC 


CATCTGGTCA 


GAAAAGACAA 


TCTTTTTGTT 


7740 


GTCAAGCTTG 


GTGGCAAACG 


ACCCGTAGAG 


GGCGTTGGAC 


AGCAACTTGG 


CGATGGAGCG 


7800 


CAGGGTTTGG 


TTTTTGTCGC 


GATCGGCGCG 


CTCCTTGGCC 


GCGATGTTTA 


GCTGCACGTA 


7860 


TTCGCGCGCA 


ACGCACCGCC 


ATTCGGGAAA 


GACGGTGGTG 


CGCTCGTCGG 


GCACCAGGTG 


7920 


CACGCGCCAA 


CCGCGGTTGT 


GCAGGGTGAC 


AAGGTCAACG 


CTGGTGGCTA 


CCTCTCCGCG 


7980 


TAGGCGCTCG 


TTGGTCCAGC 


AGAGGCGGCC 


GCCCTTGCGC 


GAGCAGAATG 


GCGGTAGGGG 


8040 


GTCTAGCTGC 


GTCTCGTCCG 


GGGGGTCTGC 


GTCCACGGTA 


AAGACCCCGG 


GCAGCAGGCG 


8100 


CGCGTCGAAG 


TAGTCTATCT 


TGCATCCTTG 


CAAGTCTAGC 


GCCTGCTGCC 


ATGCGCGGGC 


8160 


GGCAAGCGCG 


CGCTCGTATG 


GGTTGAGTGG 


GGGACCCCAT 


GGCATGGGGT 


GGGTGAGCGC 


8220 


GGAGGCGTAC 


ATGCCGCAAA 


TGTCGTAAAC 


GTAGAGGGGC 


TCTCTGAGTA 


TTCCAAGATA 


8280 


TGTAGGGTAG 


CATCTTCCAC 


CGCGGATGCT 


GGCGCGCACG 


TAATCGTATA 


GTTCGTGCGA 


8340 


GGGAGCGAGG 


AGGTCGGGAC 


CGAGG TTGCT 


ACGGGCGGGC 


TGCTCTGCTC 


GGAAGACTAT 


8400 


CTGCCTGAAG 


ATGGCATGTG 


AGTTGGATGA 


TATGGTTGGA 


CGCTGGAAGA 


CGTTGAAGCT 


8460 


GGCGTCTGTG 


AGACCTACCG 


CGTCACGCAC 


GAAGGAGGCG 


TAGGAGTCGC 


GCAGCTTGTT 


8520 


GACCAGCTCG 


GCGGTGACCT 


GCACGTCTAG 


GGCGCAGTAG 


TCCAGGGTTT 


CCTTGATGAT 


8580 



GTCATACTTA TCCTGTCCCT TTTTTTTCCA CAGCTCGCGG TTGAGGACAA ACTCTTCGCG 
GTCTTTCCAG TACTCTTGGA TCGGAAACCC GTCGGCCTCC GAACGGTAAG AGCCTAGCAT 
GTAGAACTGG TTGACGGCCT GGTAGGCGCA GCATCCCTTT TCTACGGGTA GCGCGTATGC 
CTGCGCGGCC TTCCGGAGCG AGGTGTGGGT GAGCGCAAAG GTGTCCCTGA CCATGACTTT 
GAGGTACTGG TATTTGAAGT CAGTGTCGTC GCATCCGCCC TGCTCCCAGA GCAAAAAGTC 
CGTGCGCTTT TTGGAACGCG GATTTGGCAG GGCGAAGGTG ACATCGTTGA AGAGTATCTT 
TCCCGCGCGA GGCATAAAGT TGCGTGTGAT GCGGAAGGGT CCCGGCACCT CGGAACGGTT 
GTTAATTACC TGGGCGGCGA GCACGATCTC GTCAAAGCCG TTGATGTTGT GGCCCACAAT 
GTAAAGTTCC AAGAAGCGCG GGATGCCCTT GATGGAAGGC AATTTTTTAA GTTCCTCGTA 
GGTGAGCTCT TCAGGGGAGC TGAGCCCGTG CTCTGAAAGG GCCCAGTCTG CAAGATGAGG 
GTTGGAAGCG ACGAATGAGC TCCACAGGTC ACGGGCCATT AGCATTTGCA GGTGGTCGCG 
AAAGGTCCTA AACTGGCGAC CTATGGCCAT TTTTTCTGGG GTGATGCAGT AGAAGGTAAG 
CGGGTCTTGT TCCCAGCGGT CCCATCCAAG GTTCGCGGCT AGGTCTCGCG CGGCAGTCAC 
TAGAGGCTCA TCTCCGCCGA ACTTCATGAC CAGCATGAAG GGCACGAGCT GCTTCCCAAA 
GGCCCCCATC CAAGTATAGG TCTCTACATC GTAGGTGACA AAGAGACGCT CGGTGCGAGG 
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8700 

8760 
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8 88 0 

8940 
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<^Br ACTGGATCTC CCGCCACCAA TTGGAGGA^T C 



ATGCGAGCCG 


ATCGGGAACT 


ACTGGATCTC 


CCGCCACCAA 


TTGGAGGACT^ 


GGCTATTGAT 


9540 


GTGGTGAAAG 


TAGAAGTCCC 


TGCGACGGGC 


CGAACACTCG 


TGCTGGCTTT 


TGTAAAAACG 


•9600 


TGCGCAGTAC 


TGGCAGCGGT 


GCACGGGCTG 


TACATCCTGC 


ACGAGGTTGA 


CCTGACGACC 


9660 


GCGCACAAGG 


AAGCAGAGTG 


GGAATTTGAG 


CCCCTCGCCT 


GGCGGGTTTG 


GCTGGTGGTC 


9720 


TTCTACTTCG 


GCTGCTTGTC 


CTTGACCGTC 


TGGCTGCTCG 


AGGGGAGTTA 


CGGTGGATCG 


9780 


GACCACCACG 


CCGCGCGAGC 


CCAAAGTCCA 


GATGTCCGCG 


CGCGGCGGTC 


GGAGCTTGAT 


9840 


GACAACATCG 


CGCAGATGGG 


AGCTGTCCAT 


GGTCTGGAGC 


TCCCGCGGCG 


TCAGGTCAGG 


9900 


CGGGAGCTCC 


TGCAGGTTTA 


CCTCGCATAG 


ACGGGTCAGG 


GCGCGGGCTA 


GATCCAGGTG 


9960 


ATACGTAATT 


TCCAGGGGCT 


GGTTGGTGGC 


GGCGTCGATG 


GCTTGCAAGA 


GGCCGCATCC 


10020 


CCGCGGCGCG 


ACTACGGTAC 


CGCGCGGCGG 


GCGGTGGGCC 


GCGGGGGTGT 


CCTTGGATGA 


10080 


TGCATCTAAA 


AGCGGTGACG 


CGGGCGAGCC 


CCCGGAGGTA 


GGGGGGGCTC 


CGGACCCGCC 


10140 


GGGAGAGGGG 


GCAGGGGCAC 


GTCGGCGCCG 


CGCGCGGGCA 


GGAGCTGGTG 


CTGCGCGCGT 


10200 


AGGTTGCTGG 


CGAACGCGAC 


GACGCGGCGG 


TTGATCTCCT 


GAATCTGGCG 


CCTCTGCGTG 


10260 


AAGACGACGG 


GCCCGGTGAG 


CTTGAGCCTG 


AAAGAGAGTT 


CGACAGAATC 


AATTTCGGTG 


10320 


TCGTTGACGG 


CGGCCTGGCG 


CAAAATCTCC 


TGCACGTCTC 


CTGAGTTGTC 


TTGATAGGCG 


10380 


ATCTCGGCCA 


TGAACTGCTC 


GATCTCTTCC 


TCCTGGAGAT 


CTCCGCGTCC 


GGCTCGCTCC 


10440 


ACGGTGGCGG 


CGAGGTCGTT 


GGAAATGCGG 


GCCATGAGCT 


GCGAGAAGGC 


GTTGAGGCCT 


10500 


CCCTCGTTCC 


AGACGCGGCT 


GTAGACCACG 


CCCCCTTCGG 


CATCGCGGGC 


GCGCATGACC 


10560 


ACCTGCGCGA 


GATTGAGCTC 


CACGTGCCGG 


GCGAAGACGG 


CGTAGTTTCG 


CAGGCGCTGA 


10620 


AAGAGGTAGT 


TGAGGGTGGT 


GGCGGTGTGT 


TCTGCCACGA 


AGAAGTACAT 


AACCCAGCGT 


10680 


CGCAACGTGG 


ATTCGTTGAT 


ATCCCCCAAG 


GCCTCAAGGC 


GCTCCATGGC 


CTCGTAGAAG 


10740 


TCCACGGCGA 


AGTTGAAAAA 


CTGGGAGTTG 


CGCGCCGACA 


CGGTTAACTC 


CTCCTCCAGA 


10800 


AGACGGATGA 


GCTCGGCGAC 


AGTGTCGCGC 


ACCTCGCGCT 


CAAAGGCTAC 


AGGGGCCTCT 


10860 


TCTTCTTCTT 


CAATCTCCTC 


TTCCATAAGG 


GCCTCCCCTT 


CTTCTTCTTC 


TGGCGGCGGT 


10920 


GGGGGAGGGG 


GGACACGGCG 


GCGACGACGG 


CGCACCGGGA 


GGCGGTCGAC 


AAAGCGCTCG 


10980 


ATCATCTCCC 


CGCGGCGACG 


GCGCATGGTC 


TCGGTGACGG 


CGCGGCCGTT 


CTCGCGGGGG 


11040 


CGCAGTTGGA 


AGACGCCGCC 


CGTCATGTCC 


CGGTTATGGG 


TTGGCGGGGG 


GCTGCCATGC 


11100 


GGCAGGGATA 


CGGCGCTAAC 


GATGCATCTC 


AACAATTGTT 


GTGTAGGTAC 


TCCGCCGCCG 


11160 


AGGGACCTGA 


GCGAGTCCGC 


ATCGACCGGA 


TCGGAAAACC 


TCTCGAGAAA 


GGCGTCTAAC 


11220 


CAGTCACAGT 


CGCAAGGTAG 


GCTGAGCACC 


GTGGCGGGCG 


GCAGCGGGCG 


GCGGTCGGGG 


11280 


TTGTTTCTGG 


CGGAGGTGCT 


GCTGATGATG 


TAATTAAAGT 


AGGCGGTCTT 


GAGACGGCGG 


11340 


ATGGTCGACA 


GAAGCACCAT 


GTCCTTGGGT 


CCGGCCTGCT 


GAATGCGCAG 


GCGGTCGGCC 


11400 


ATGCCCCAGG 


CTTCGTTTTG 


ACATCGGCGC 


AGGTCTTTGT 


AGTAGTCTTG 


CATGAGCCTT 


11460 


TCTACCGGCA 


CTTCTTCTTC 


TCCTTCCTCT 


TGTCCTGCAT 


CTCTTGCATC 


TATCGCTGCG 


11520 


GCGGCGGCGG 


AGTTTGGCCG 


TAGGTGGCGC 


CCTCTTCCTC 


CCATGCGTGT 


GACCCCGAAG 


11580 



11640 
11700 
11760 
11820 
11880 
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CCCCTCATCG GCTGAA^G GGCTAGGTCG GCGACAACGC GCTCGGCTAA TATGGCCTGC 
TGCACCTGCG TGAGGGTAGA CTGGAAGTCA TCCATGTCCA CAAAGCGGTG GTATGCGCCC 
GTGTTGATGG TGTAAGTGCA GTTGGCCATA ACGGACCAGT TAACGGTCTG GTGACCCGGC 
TGCGAGAGCT CGGTGTACCT GAGACGCGAG TAAGCCCTCG AGTCAAATAC GTAGTCGTTG 
CAAGTCCGCA CCAGGTACTG GTATCCCACC AAAAAGTGCG GCGGCGGCTG GCGGTAGAGG 
GGCCAGCGTA GGGTGGCCGG GGCTCCGGGG GCGAGATCTT CCAACATAAG GCGATGATAT X1940 
CCGTAGATGT ACCTGGACAT CCAGGTGATG CCGGCGGCGG TGGTGGAGGC GCGCGGAAAG 12000 
TCGCGGACGC GGTTCCAGAT GTTGCGCAGC GGCAAAAAGT GCTCCATGGT CGGGACGCTC 12060 
TGGCCGGTCA GGCGCGCGCA ATCGTTGACG CTCTACCGTG CAAAAGGAGA GCCTGTAAGC 12120 
GGGCACTCTT CCGTGGTCTG GTGGATAAAT TCGCAAGGGT ATCATGGCGG ACGACCGGGG 12180 
TTCGAGCCCC GTATCCGGCC GTCCGCCGTG ATCCATGCGG TTACCGCCCG CGTGTCGAAC 12240 
CCAGGTGTGC GACGTCAGAC AACGGGGGAG TGCTCCTTTT GGCTTCCTTC CAGGCGCGGC 12300 
GGCTGCTGCG CTAGCTTTTT TGGCCACTGG CCGCGCGCAG CGTAAGCGGT TAGGCTGGAA 12360 
AGCGA.AAGCA TTAAGTGGCT CGCTCCCTGT AGCCGGAGGG TTATTTTCCA AGGGTTGAGT 12420 

CGCGGGACCC CCGGTTCGAG TCTCGGACCG GCCGGACTGC GGCGAACGGG GGTTTGCCTC 12480 

CCCGTCATGC AAGACCCCGC TTGCAAATTC CTCCGGAAAC AGGGACGAGC CCCTTTTTTG 12540 

CTTTTCCCAG ATGCATCCGG TGCTGCGGCA GATGCGCCCC CCTCCTCAGC AGCGGCAAGA 12600 

GCAAGAGCAG CGGCAGACAT GCAGGGCACC CTCCCCTCCT CCTACCGCGT CAGGAGGGGC 12660 

GACATCCGCG GTTGACGCGG CAGCAGATGG TGATTACGAA CCCCCGCGGC GCCGGGCCCG 12720 

GCACTACCTG GACTTGGAGG AGGGCGAGGG CCTGGCGCGG CTAGGAGCGC CCTCTCCTGA 12780 

GCGGTACCCA AGGGTGCAGC TGAAGCGTGA TACGCGTGAG GCGTACGTGC CGCGGCAGAA 12840 

CCTGTTTCGC GACCGCGAGG GAGAGGAGCC CGAGGAGATG CGGGATCGAA AGTTCCACGC 12900 

AGGGCGCGAG CTGCGGCATG GCCTGAATCG CGAGCGGTTG CTGCGCGAGG AGGACTTTGA 12960 

GCCCGACGCG CGAACCGGGA TTAGTCCCGC GCGCGCACAC GTGGCGGCCG CCGACCTGGT 13020 

AACCGCATAC GAGCAGACGG TGAACCAGGA GATTAACTTT CAAAAAAGCT TTAACAACCA 13080 

CGTGCGTACG CTTGTGGCGC GCGAGGAGGT GGCTATAGGA CTGATGCATC TGTGGGACTT 1314 0 

TGTAAGCGCG CTGGAGCAAA ACCCAAATAG CAAGCCGCTC ATGGCGCAGC TGTTCCTTAT 13200 

AGTGCAGCAC AGCAGGGACA ACGAGGCATT CAGGGATGCG CTGCTAAACA TAGTAGAGCC 13260 

CGAGGGCCGC TGGCTGCTCG ATTTGATAAA CATCCTGCAG AGCATAGTGG TGCAGGAGCG 13320 

CAGCTTGAGC CTGGCTGACA AGGTGGCCGC CATCAACTAT TCCATGCTTA GCCTGGGCAA 13380 

GTTTTACGCC CGCAAGATAT ACCATACCCC TTACGTTCCC ATAGACAAGG AGGTAAAGAT 13440 

CGAGGGGTTC TACATGCGCA TGGCGCTGAA GGTGCTTACC TTGAGCGACG ACCTGGGCGT 13500 

TTATCGCAAC GAGCGCATCC ACAAGGCCGT GAGCGTGAGC CGGCGGCGCG AGCTCAGCGA 13 560 

CCGCGAGCTG ATGCACAGCC TGCAAAGGGC CCTGGCTGGC ACGGGCAGCG GCGATAGAGA 13620 

GGCCGAGTCC TACTTTGACG CGGGCGCTGA CCTGCGCTGG GCCCCAAGCC GACGCGCCCT 13680 
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GGAGGCAGCT 

CGGCGTGGAG 
GGTGATGTTT 
CAGAGCCAGC 
ATGTCGCTGA 
TCCGCAATTC 
GCGATCGTAA 
TACGACGCGC 
GACCGGCTGG 
GGCAACCTGG 
CCGCGGGGAC 
ACACCGCAAA 
GGCCTGCAGA 
CGGGCTCCCA 
TTGCTGCTGC 
GGTCACTTGC 
TTCCAGGAGA 
GCAACCCTAA 
AACAGCGAGG 
CGCGACGGGG 
ATGTATGCCT 
GCCGTGAACC 
GGTTTCTACA 
ATAGACGACA 
CAGGCAGAGG 
GGCGCTGCGG 
ACCAGCACTC 
CTGCTGCAGC 
AGCCTAGTGG 
GGCCCGCGCC 
GACGATGACT 
GCGCACCTTC 
AAAAAACTCA 
GGCGCGCGGC 
CGCCAGTGGC 



GGGGCCGGAC 
GAATATGACG 
CTGATCAGAT 
CGTCCGGCCT 
CTGCGCGCAA 
TGGAAGCGGT 
ACGCGCTGGC 
TGCTTCAGCG 
TGGGGGATGT 
GCTCCATGGT 
AGGAGGACTA 
GTGAGGTGTA 
CCGTAAACCT 
CAGGCGACCG 
TAATAGCGCC 
TGACACTGTA 
TTACAAGTGT 
ACTACCTGCT 
AGGAGCGCAT 
TAACGCCCAG 
CAAACCGGCC 
CCGAGTATTT 
CCGGGGGATT 
GCGTGTTTTC 
CGGCGCTGCG 
CCCCGCGGTC 
GCACCACCCG 
CGCAGCGCGA 
ACAAGATGAG 
CGCCCACCCG 
CGGCAGACGA 
GCCCCAGGCT 
CCAAGGCCAT 
GATGTATGAG 
GGCGGCGCTG 



CTGGGCTGGC 
AGGACGATGA 
GATGCAAGAC 
TAACTCCACG 
TCCTGACGCG 
GGTCCCGGCG 
CGAAAACAGG 
CGTGGCTCGT 
GCGCGAGGCC 
TGCACTAAAC 
CACCAACTTT 
CCAGTCTGGG 
GAGCCAGGCT 
CGCGACCGTG 
CTTCACGGAC 
GCGCGAGGCC 
CAGCCGCGCG 
GACCAACCGG 
TTTGCGCTAC 
CGTGGCGCTG 
GTTTATCAAC 
CACCAATGCC 
CGAGGTGCCC 
CCCGCAACCG 
AAAGGAAAGC 
AGATGCTAGT 
CCCGCGCCTG 
AAAAAACCTG 
TAGATGGAAG 
TCGTCAAAGG 
CAGCAGCGTC 
GGGGAGAATG 
GGCACCGAGC 
GAAGGTCCTC 
GGTTCTCCCT 



GGTGGCACCC 
GTACGAGCCA 
GCAACGGACC 
GACGACTGGC 
TTCCGGCAGC 
CGCGCAAACC 
GCCATCCGGC 
TACAACAGCG 
GTGGCGCAGC 
GCCTTCCTGA 
GTGAGCGCAC 
CCAGACTATT 
TTCAAAAACT 
TCTAGCTTGC 
AGTGGCAGCG 
ATAGGTCAGG 
CTGGGGCAGG 
CGGCAGAAGA 
GTGCAGCAGA 
GACATGACCG 
CGCCTAATGG 
ATCTTGAACC 
GAGGGTAACG 
CAGACCCTGC 
TTCCGCAGGC 
AGCCCATTTC 
CTGGGCGAGG 
CCTCCGGCAT 
ACGTACGCGC 
CACGACCGTC 
CTGGATTTGG 
TTTTAAAAAA 
GTTGGTTTTC 
CTCCCTCCTA 
TCGATGCTCC 



GCGCGCGCTG 
GAGGACGGCG 
CGGCGGTGCG 
GCCAGGTCAT 
AGCCGCAGGC 
CCACGCACGA 
CCGACGAGGC 
GCAACGTGCA 
GTGAGCGCGC 
GTACACAGCC 
TGCGGCTAAT 
TTTTCCAGAC 
TGCAGGGGCT 
TGACGCCCAA 
TGTCCCGGGA 
CGCATGTGGA 
AGGACACGGG 
TCCCCTCGTT 
GCGTGAGCCT 
CGCGCAACAT 
ACTACTTGCA 
CGCACTGGCT 
ATGGATTCCT 
TAGAGTTGCA 
CAAGCAGCTT 
CAAGCTTGAT 
AGGAGTACCT 
TTCCCAACAA 
AGGAGCACAG 
AGCGGGGTCT 
GAGGGAGTGG 
AAAAAAGCAT 
TTGTATTCCC 
CGAGAGTGTG 
CCTGGACCCG 



PCT/US97/ 19541 

GCAACGTCGG 13 740 

AGTACTAAGC 13 800 

GGCGGCGCTG 13 860 

GGACCGCATC 13 920 

CAACCGGCTC 13 980 

GAAGGTGCTG 14 040 

CGGCCTGGTC 14100 

GACCAACCTG 14 16 0 

GCAGCAGCAG 14 220 

CGCCAACGTG 14 28 0 

GGTG ACTG AG 14 3 4 0 

CAGTAGAC AA 144 0 0 

GTGGGGGGTG 14 4 6 0 

CTCGCGCCTG 14 52 0 

C ACATACCTA 14 5 8 0 

CG AG CAT ACT 14 6 4 0 

CAGCCTGGAG 14 700 

GCACAGTTTA 14 760 

TAACCTGATG 14 82 0 

GGAACCGGGC 14 88 0 

TCGCGCGGCC 14 940 

ACCGCCCCCT 15 00 0 

CTGGGACGAC 15 060 

ACAGCGCGAG 1512 0 

GTCCGATCTA 1518 0 

AGGGTCTCTT 1524 0 

AAACAACTCG 15 3 0 0 

CGGGATAGAG 15 36 0 

GGACGTGCCA 15420 

GGTGTGGGAG 1548 0 

CAACCCGTTT 15 540 

GATGCAAAAT 156 00 

CTTAGTATGC 15660 

GTGAGCGCGG 15720 

CCGTTTGTGC 1578 0 
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CTCCGCGGTA 


CCTGCGC;^^^ 


) 

\_ 'wi o O \j A 


GAAACAGCAT 


CCGTTACTCT 


PCT/US97/19541 

' GAGTTGGCAC 15 840 


CCCTATTCGA 


^ w V- V-i \— . L 




TGGACAACAA 


GTCAACGGAT 


GTGGCATCCC 


15900 


TGAACTACCA 


A A pn a rr* a r* 


ALaLAACTTTC 


TGACCACGGT 


CATTCAAAAC 


AATGACTACA 


15960 


GCCCGGGGGA 


GGCAAHPflPi 




ATCTTGACGA 


CCGGTCGCAC 


TGGGGCGGCG 


16020 


ACCTGAAAAC 




AU L AACATGC 


CAAATGTGAA 


CGAGTTCATG 


TTTACC.AJVTA 


16080 


AGTTTAAGGC 


GCGGGTnATn 




TGCCTACTAA 


GGACAATCAG 


GTGGAGCTGA 


16140 


AATACGAGTG 




AUOt- 1 LjLv_t_G 


AGGGCAACTA 


CTCCGAGACC 


ATGACCATAG 


16200 


ACCTTATGAA 




Li 1 tjLiA(jC_ACT 


ACTTG AAAG T 


GGGCAGACAG 


AACGGGGTTC 


16260 


TGGAAAGCGA 




AALj ill UAU A 


CCCGCAACTT 


CAGACTGGGG 


TTTGACCCCG 


16320 


TCACTGGTCT 


TGTCATGCPT 


VjkjtjVj 1 A i A i A 


LAAACGAAGC 


CTTCCATCCA 


GACATCATTT 


16380 


TGCTGCCAGG 


ATGPGGGGTn 


p A P' I "rr^ A f^cT"^ 


ACAGCCGCCT 


GAGCAACTTG 


TTGGGCATCC 


16440 


GCAAGCGGCA 


ACPPTTPPAP 


oALjVjOL-I i iA 


GGATCACCTA 


CGATGATCTG 


GAGGGTGGTA 


16500 


■ ACATTCCCGC 




V? i \jOAL-Lji_C i 


ACCAGGCGAG 


CTTGAAAGAT 


GACACCGAAC 


16560 


AGGGCGGGGG 


TGGPnPAPPP 


V7 L. Avj C AA k_ A 


GCAGTGGCAG 


CGGCGCGGAA 


GAGAACTCCA 


16620 


ACGCGGCAGC 


PGPPn PA ATP 




AGGACATGAA 


CGATCATGCC 


ATTCGCGGCG 


16680 


ACACCTTTGC 


PAPAPPPPPT 


Vj AL> Lj Avj AACj C 


GCGCTGAGGC 


CGAAGCAGCG 


GCCGAAGCTG 


16740 


CCGCCCCCGC 


TPPPP A APPP 


(jAGCj 1 CGAGA 


AGCCTCAGAA 


GAAACCGGTG 


ATCAAACCCC 


16800 


TGACAGAGGA 


PAPPAAP A A A 


^-L»(^AG 1 1AV_A 


ACCTAATAAG 


CAATGACAGC 


ACCTTCACCC 


16860 


AGTAPPHfAr; 

^^yj -L ^ \w j~vvj 


PTTT^T a rr^T"!* 


CjCATACAACT 


ACGGCGACCC 


TCAGACCGGA 


ATCCGCTCAT 


16920 




i i o^^j-V\_ i i 


(jACGTAACCT 


GCGGCTCGGA 


GCAGGTCTAC 


TGGTCGTTGC 


16980 


c An A r A T^r; a T 




GTGACCTTCC 


GCTCCACGCG 


CCAGATCAGC 


AACTTTCCGG 


17040 


TGGTGGGCGC 


PP APPTnTTT* 


H-CCj 1 GCACT 


CCAAGAGCTT 


CTACAACGAC 


CAGGCCGTCT 


17100 


ACTCCC AACT 




i i lACCTCTC 


TGACCCACGT 


GTTCAATCGC 


TTTCCCGAGA 


17160 


A C C AG A T T^'T' T 




uCACjCCCCCA 


CCATCACCAC 


CGTCAGTGAA 


AACGTTCCTG 


17220 


CTCTCACAGA 




CTACCGCTGC 


GCAACAGCAT 


CGGAGGAGTC 


CAGCGAGTGA 


17280 


CCATTACTr;A 




uCjCACCTGCC 


CCTACGTTTA 


CAAGGCCCTG 


GGCATAGTCT 


17340 


CGCCGCGCGT 


PPTQ TTT* A rr* 


TV 1 tf 1 If 1 If nf 1 1 

C-GCACTTTTT 


GAGCAAGCAT 


GTCCATCCTT 


ATATCGCCCA 


17400 


G C AAT AA CAP 




uTGCGCTTCC 


CAAGCAAGAT 


GTTTGGCGGG 


GCCAAGAAGC 


17460 


GCTCCGACCA 


AP A r'f~*r^Tir"vr' 


v-GCGTGCGCG 


GGCACTACCG 


CGCGCCCTGG 


GGCGCGCACA 


17S20 


AACGCGGCCG 


CACTCzczn.rTT' 


av„LALCGTCG 


ATGACGCCAT 


CGACGCGGTG 


GTGGAGGAGG 


17580 


CGCGCAACTA 


P A ccc^c a c^r* 


LCGCCACCAG 


TGTCCACAGT 


GGACGCGGCC 


ATTCAGACCG 


17640 


TGGTGCGCGG 


V„ ^ \_ L9*J ^ O V*. 


1 ATGCTAAAA 


TGAAGAGACG 


GCGGAGGCGC 


GTAGCACGTC 


17700 


>^ W %J \w \J 


V- v_ L. u Li v_ 


ACTGCCGCCC 


AACGCGCGGC 


GGCGGCCCTG 


CTTAACCGCG 


17760 


CACGTCGCAC 


PPPPPP ArTT* 
^ O Vj v_ V_ La v_ Vj Lj 


^^^^ TV ^^^^ 


GGGCCGCTCG 


AAGGCTGGCC 


GCGGGTATTG 


17820 




CCCCAGGTCC 


AGGCGACGAG 


CGGCCGCCGC 


AGCAGCCGCG 


GCCATTAGTG 


17880 
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CTATGACTCA 

GCGTGCCCGT 
CGTACTGTTG 
TCAAAGAAGA 
AGCAGGATTA 
ATGAACTTGA 
GGAAAGGTCG 
GTGAGCGCTC 
TGCTTGAGCA 
TGCTGGCGTT 
AGCAGGTGCT 
GTGACTTGGC 
TGGAAAAAAT 
AGGTGGCGCC 
CCAGTATTGC 
TGGCGGATGC 
AAACGGACCC 
ACGGCGCCGC 
CCGGCTATCG 
CCACTGGAAC 
GCAGGGTGGC 
GCATCGTTTA 
GTTTCCCGGT 
GCCTGACGGG 
TGCGCGGCGG 
CCGGAATTGC 
GGAAAAATCA 
AGAATGGAAG 
GGAAACTGGC 
CTGTGGAGCG 
AACAGCAGCA 
GTGGTAGATG 
CAAAATAAGA 
GTGGAGACAG 
ACTCTGGTGA 



gggtcgWgg 
gcgcacccgc 
tatgtatcca 
gatgctccag 
caagccccga 
cgacgaggtg 
acgcgtaaaa 
cacccgcacc 
ggccaacgag 
gccgctggac 
gcccgcgctt 
acccaccgtg 
gaccgtggaa 
gggactgggc 
caccgccaca 
cgcggtgcag 
gtggatgttt 
cagcgcgcta 
tggctacacc 
ccgccgccgc 
tcgcgaagga 
aaagccggtc 
gccgggattc 
cggcatgcgt 
tatcctgccc 
atccgtggcc 
aaataaaaag 
acatcaactt 
aagatatcgg 

GCATTAAAAA 
CAGGCCAGAT 
GCCTGGCCTC 
TTAACAGTAA 
TGTCTCCAGA 
CGCAAATAGA 



GGCAACGTGT 
CCCCCGCGCA 
GCGGCGGCGG 
GTCATCGCGC 
AAGCTAAAGC 
GAACTGCTGC 
CGTGTTTTGC 
TACAAGCGCG 
CGCCTCGGGG 
GAGGGCAACC 
GCACCGTCCG 
CAGCTGATGG 
CCTGGGCTGG 
GTGCAGACCG 
GAGGGCATGG 
GCGGTCGCTG 
CGCGTTTCAG 
CTGCCCGAAT 
TACCGCCCCA 
CGTCGCCGTC 
GGCAGGACCC 
TTTGTGGTTC 
CGAGGAAGAA 
CGTGCGCACC 
CTCCTTATTC 
TTGCAGGCGC 
TCTGGACTCT 
TGCGTCTCTG 
CACCAGCAAT 
TTTCGGTTCC 
GCTGAGGGAT 
TGGCATTAGC 
GCTTGATCCC 
GGGGCGTGGC 
CGAGCCTCCC 



ATTGGGTGCG 
ACTAGATTGC 
CGCGCAACGA 
CGGAGATCTA 
GGGTCAAAAA 
ACGCTACCGC 
GACCCGGCAC 
TGTATGATGA 
AGTTTGCCTA 
CAACACCTAG 
AAGAAAAGCG 
TACCCAAGCG 
AGCCCGAGGT 
TGGACGTTCA 
AGACACAAAC 
CGGCCGCGTC 
CCCCCCGGCG 
ATGCCCTACA 
GAAGACGAGC 
GCCAGCCCGT 
TGGTGCTGCC 
TTGCAGATAT 
TGCACCGTAG 
ACCGGCGGCG 
CACTGATCGC 
AGAGACACTG 
CACGCTCGCT 
GCCCCGCGAC 
ATGAGCGGTG 
ACCGTTAAGA 
AAGTTGAAAG 
GGGGTGGTGG 
CGCCCTCCCG 
GAAAAGCGTC 
TCGTACGAGG 



CGACTCGGTT 
AAGAAAAAAC 
AGCTATGTCC 
TGGCCCCCCG 
GAAAAAGAAA 
GCCCAGGCGA 
CACCGTAGTC 
GGTGTACGGC 
CGGAAAGCGG 
CCTAAAGCCC 
CGGCCTAAAG 
CCAGCGACTG 
CCGCGTGCGG 
GATACCCACT 
GTCCCCGGTT 
CAAGACCTCT 
CCCGCGCGGT 
TCCTTCCATT 
AACTACCCGA 
GCTGGCCCCG 
AACAGCGCGC 
GGCCCTCACC 
GAGGGGCATG 
GCGCGCGTCG 
CGCGGCGATT 
ATTAAAAACA 
TGGTCCTGTA 
ACGGCTCGCG 
GCGCCTTCAG 
ACTATGGCAG 
AGCAAAATTT 
ACCTGGCCAA 
TAGAGGAGCC 
CGCGCCCCGA 
AGGCACTAAA 



PCT/US97/19541 

AGCGGCCTGC 17940 

TACTTAGACT 18 000 

AAGCGCAAAA 18 060 

AAG AAGG AAG 1312 0 

GATGATGATG 1818 0 

CGGGTACAGT 18 240 

TTTACGCCCG 18300 

GACGAGGACC 1836 0 

CATAAGGACA 184 2 0 

GTAACACTGC 184 8 0 

CGCGAGTCTG 18 540 

GAAGATGTCT 18600 

CCAATCAAGC 1866 0 

ACCAGTAGCA 1872 0 

GCCTCAGCGG 18 780 

ACGGAGGTGC 1884 0 

TCGAGGAAGT 18 900 

GCGCCTACCC 18 960 

CGCCGAACCA 19020 

ATTTCCGTGC 19080 

TACCACCCCA 1914 0 

TGCCGCCTCC 19200 

GCCGGCCACG 19260 

CACCGTCGCA 19320 

GGCGCCGTGC 19380 

AGTTGCATGT 19440 

ACTATTTTGT 19500 

CCCGTTCATG 19 56 0 

CTGGGGCTCG 19620 

CAAGGCCTGG 19680 

CCAACAAAAG 19740 

CCAGGCAGTG 198 00 

TCCACCGGCC 19860 

CAGGGAAGAA 19920 

GCAAGGCCTG 19 980 
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CCCACCACCC 


gtcccatW? 


) 

GCCCATGGCT 


ACCGGAGTGC 


TGGGCCAGCA 


PCT/TJS97/I9541 

CACACCCGTA 2 0040 


ACGCTGGACC 


TGCCTCCCCC 


CGCCGACACC 


CAGCAGAAAC 


CTGTGCTGCC 


AGGCCCGACC 


20100 


GCCGTTGTTG 


taacccgtcc 


TAGCCGCGCG 


TCCCTGCGCC 


GCGCCGCCAG 


CGGTCCGCGA 


20160 


TCGTTGCGGC 


ccgtagccag 


TGGCAACTGG 


CAAAGCACAC 


TGAACAGCAT 


CGTGGGTCTG 


20220 


GGGGTGCAAT 


ccctgaagcg 


CCGACGATGC 


TTCTGAATAG 


CTAACGTGTC 


GTATGTGTGT 


20280 


. CATGTATGCG 


tccatgtcgc 


CGCCAGAGGA 


GCTGCTGAGC 


CGCCGCGCGC 


CCGCTTTCCA 


20340 


AGATGGCTAC 


cccttcgatg 


ATGCCGCAGT 


GGTCTTACAT 


GCACATCTCG 


GGCCAGGACG 


20400 


CCTCGGAGTA 


cctgagcccc 


GGGCTGGTGC 


AGTTTGCCCG 


CGCCACCGAG 


ACGTACTTCA 


20460 


GCCTGAATAA 


CAAGTTTAGA 


AACCCCACGG 


TGGCGCCTAC 


GCACGACGTG 


ACCACAGACC 


20520 


GGTCCCAGCG 


tttgacgctg 


CGGTTCATCC 


CTGTGGACCG 


TGAGGATACT 


GCGTACTCGT 


20580 


ACAAGGCGCG 


GTTCACCCTA 


GCTGTGGGTG 


ATAACCGTGT 


GCTGGACATG 


GCTTCCACGT 


20640 


ACTTTGACAT 


CCGCGGCGTG 


CTGGACAGGG 


GCCCTACTTT 


TAAGCCCTAC 


TCTGGCACTG 


20700 


CCTACAACGC 


CCTGGCTCCC 


AAGGGTGCCC 


CAAATCCTTG 


CGAATGGGAT 


GAAGCTGCTA 


20760 



CTGCTCTTGA AATAAACCTA GAAGAAGAGG ACGATGACAA CGAAGACGAA GTAGACGAGC 
AAGCTGAGCA GCAAAAAACT CACGTATTTG GGCAGGCGCC TTATTCTGGT ATAAATATTA 
CAAAGGAGGG TATTCAAATA GGTGTCGAAG GTCAAACACC TAAATATGCC GATAAAACAT 
TTCAACCTGA ACCTCAAATA GGAGAATCTC AGTGGTACGA AACTGAAATT AATCATGCAG 
CTGGGAGAGT CCTTAAAAAG ACTACCCCAA TGAAACCATG TTACGGTTCA TATGCAAAAC 
CCACAAATGA AAATGGAGGG CAAGGCATTC TTGTAAAGCA ACAAAATGGA AAGCTAGAAA 
GTCAAGTGGA .^U^TGCAATTT TTCTCAACTA CTGAGGCGAC CGCAGGCAAT GGTGATAACT 
TGACTCCTAA AGTGGTATTG TACAGTGAAG ATGTAGATAT AGAAACCCCA GACACTCATA 
TTTCTTACAT GCCCACTATT AAGGAAGGTA ACTCACGAGA ACTAATGGGC CAACAATCTA 
TGCCCAACAG GCCTAATTAC ATTGCTTTTA GGGACAATTT TATTGGTCTA ATGTATTACA 
ACAGCACGGG TAATATGGGT GTTCTGGCGG GCCAAGCATC GCAGTTGAAT GCTGTTGTAG 
ATTTGCAAGA CAGAAACACA GAGCTTTCAT ACCAGCTTTT GCTTGATTCC ATTGGTGATA 
vGAACCAGGTA CTTTTCTATG TGGAATCAGG CTGTTGACAG CTATGATCCA GATGTTAGAA 
TTATTGAAAA TCATGGAACT GAAGATGAAC TTCCAAATTA CTGCTTTCCA CTGGGAGGTG 
TGATTAATAC AGAGACTCTT ACCAAGGTAA AACCTAAAAC AGGTCAGGAA AATGGATGGG 
AAAAAGATGC TACAGAATTT TCAGATAAAA ATGAAATAAG AGTTGGAAAT AATTTTGCCA 
TGGAAATCAA TCTAAATGCC AACCTGTGGA GAAATTTCCT GTACTCCAAC ATAGCGCTGT 
ATTTGCCCGA CAAGCTAAAG TACAGTCCTT CCAACGTAAA AATTTCTGAT AACCCAAACA 
CCTACGACTA CATGAACAAG CGAGTGGTGG CTCCCGGGTT AGTGGACTGC TACATTAACC 
TTGGAGCACG CTGGTCCCTT GACTATATGG ACAACGTCAA CCCATTTAAC CACCACCGCA 
ATGCTGGCCT GCGCTACCGC TCAATGTTGC TGGGCAATGG TCGCTATGTG CCCTTCCACA 
TCCAGGTGCC TCAGAAGTTC TTTGCCATTA AAAACCTCCT TCTCCTGCCG GGCTCATACA 
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CCTACGAGTG GAACTTCA^ AAGGATGTTA ACATGGTTCT GCAGAGC^^ CTAGGAAATG 22140 

ACCTAAGGGT TGACGGAGCC AGCATTAAGT TTGATAGCAT TTGCCTTTAC GCCACCTTCT 22200 

TCCCCATGGC CCACAACACC GCCTCCACGC TTGAGGCCAT GCTTAGAAAC GACACCAACG 2 2260 

ACCAGTCCTT TAACGACTAT CTCTCCGCCG CCAACATGCT CTACCCTATA CCCGCC.^ACG 22320 

CTACCAACGT GCCCATATCC ATCCCCTCCC GCAACTGGGC GGCTTTCCGC GGCTGGGCCT 223 80 

TCACGCGCCT TAAGACTAAG GAAACCCCAT CACTGGGCTC GGGCTACGAC CCTTATTACA 2 2440 

CCTACTCTGG CTCTATACCC TACCTAGATG GAACCTTTTA CCTCAACCAC ACCTTTAAGA 22 500 

AGGTGGCCAT TACCTTTGAC TCTTCTGTCA GCTGGCCTGG CAATGACCGC CTGCTTACCC 2 2 560 

CCAACGAGTT TGAAATTAAG CGCTCAGTTG ACGGGGAGGG TTACAACGTT GCCCAGTGTA 2 2620 

ACATGACCAA AGACTGGTTC CTGGTACAAA TGCTAGCTAA CTACAACATT GGCTACCAGG 2 26 80 

GCTTCTATAT CCCAGAGAGC TACAAGGACC GCATGTACTC CTTCTTTAGA AACTTCCAGC 2 2 740 

CCATGAGCCG TCAGGTGGTG GATGATACTA AATACAAGGA CTACCAACAG GTGGGCATCC 2 2800 

TACACCAACA CAACAACTCT GGATTTGTTG GCTACCTTGC CCCCACCATG CGCGAAGGAC 2 2 860 

AGGCCTACCC TGCTAACTTC CCCTATCCGC TTATAGGCAA GACCGCAGTT GACAGCATTA 2 2 920 

CCCAGAAAAA GTTTCTTTGC GATCGCACCC TTTGGCGCAT CCCATTCTCC AGTAACTTTA 2 2 980 

TGTCCATGGG CGCACTCACA GACCTGGGCC AAAACCTTCT CTACGCCAAC TCCGCCCACG 2 3 040 

CGCTAGACAT GACTTTTGAG GTGGATCCCA TGGACGAGCC CACCCTTCTT TATGTTTTGT 2 3100 

TTGAAGTCTT TGACGTGGTC CGTGTGCACC GGCCGCACCG CGGCGTCATC GAAACCGTGT 2 316 0 

ACCTGCGCAC GCCCTTCTCG GCCGGCAACG CCACAACATA AAGAAGCAAG CAACATCAAC 2 3 220 

AACAGCTGCC GCCATGGGCT CCAGTGAGCA GGAACTGAAA GCCATTGTCA AAGATCTTGG 2 3 280 

TTGTGGGCCA TATTTTTTGG GCACCTATGA CAAGCGCTTT CCAGGCTTTG TTTCTCCACA 2 3 340 

CAAGCTCGCC TGCGCCATAG TCAATACGGC CGGTCGCGAG ACTGGGGGCG TACACTGGAT 2 3 400 

GGCCTTTGCC TGGAACCCGC ACTCAAAAAC ATGCTACCTC TTTGAGCCCT TTGGCTTTTC 2 3 460 

TGACCAGCGA CTCAAGCAGG TTTACCAGTT TGAGTACGAG TCACTCCTGC GCCGTAGCGC 2 3 520 

CATTGCTTCT TCCCCCGACC GCTGTATAAC GCTGGAAAAG TCCACCCAAA GCGTACAGGG 2 3 580 

GCCCAACTCG GCCGCCTGTG GACTATTCTG CTGCATGTTT CTCCACGCCT TTGCCAACTG 2 3 640 

GCCCCAAACT CCCATGGATC ACAACCCCAC CATGAACCTT ATTACCGGGG TACCCAACTC 2 3 700 

CATGCTCAAC AGTCCCCAGG TACAGCCCAC CCTGCGTCGC AACCAGGAAC AGCTCTACAG 23 760 

CTTCCTGGAG CGCCACTCGC CCTACTTCCG CAGCCACAGT GCGCAGATTA GGAGCGCCAC 2 3 820 

TTCTTTTTGT CACTTGAAAA ACATGTAAAA ATAATGTACT AGAGACACTT TCAATAAAGG 2 3 880 

CAAATGCTTT TATTTGTACA CTCTCGGGTG ATTATTTACC CCCACCCTTG CCGTCTGCGC 2 3 940 
CGTTTAAAAA TCAAAGGGGT TCTGCCGCGC ATCGCTATGC GCCACTGGCA GGGACACGTT 24 000 
GCGATACTGG TGTTTAGTGC TCCACTTTVAA CTCAGGCACA ACCATCCGCG GCAGCTCGGT 2 4 060 
GAAGTTTTCA CTCCACAGGC TGCGCACCAT CACCAACGCG TTTAGCAGGT CGGGCGCCGA 2412 0 
TATCTTGAAG TCGCAGTTGG GGCCTCCGCC CTGCGCGCGC GAGTTGCGAT ACACAGGGTT 2 418 0 
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GCAGCACTGG AACACTW^A GCGCCGGGTG GTGCACGCTG GCCAGCACGC TCTTGTCGGA 
GATCAGATCC GCGTCCAGGT CCTCCGCGTT GCTCAGGGCG AACGGAGTCA ACTTTGGTAG 
CTGCCTTCCC AAAAAGGGCG CGTGCCCAGG CTTTGAGTTG CACTCGCACC GTAGTGGCAT 

caaaaggtga ccgtgcccgg tctgggcgtt aggatacagc gcctgcataa aagccttgat 

CTGCTTAAAA GCCACCTGAG CCTTTGCGCC TTCAGAGAAG AACATGCCGC AAGACTTGCC 
GGAAAACTGA TTGGCCGGAC AGGCCGCGTC GTGCACGCAG CACCTTGCGT CGGTGTTGGA 
GATCTGCACC ACATTTCGGC CCCACCGGTT CTTCACGATC TTGGCCTTGC TAGACTGCTC 
CTTCAGCGCG CGCTGCCCGT TTTCGCTCGT CACATCCATT TCAATCACGT gctccttatt 
TATCATAATG CTTCCGTGTA GACACTTAAG CTCGCCTTCG ATCTCAGCGC AGCGGTGCAG 
CCACAACGCG CAGCCCGTGG GCTCGTGATG CTTGTAGGTC ACCTCTGCAA ACGACTGCAG 
GTACGCCTGC AGGAATCGCC CCATCATCGT CACAAAGGTC TTGTTGCTGG TGAAGGTCAG 
CTGCAACCCG CGGTGCTCCT CGTTCAGCCA GGTCTTGCAT ACGGCCGCCA GAGCTTCCAC 
TTGGTCAGGC AGTAGTTTGA AGTTCGCCTT TAGATCGTTA TCCACGTGGT ACTTGTCCAT 
CAGCGCGCGC GCAGCCTCCA TGCCCTTCTC CCACGCAGAC ACGATCGGCA CACTCAGCGG 
GTTCATCACC GTAATTTCAC TTTCCGCTTC GCTGGGCTCT TCCTCTTCCT CTTGCGTCCG 
CATACCACGC GCCACTGGGT CGTCTTCATT CAGCCGCCGC ACTGTGCGCT TACCTCCTTT 
GCCATGCTTG ATTAGCACCG GTGGGTTGCT GAAACCCACC ATTTGTAGCG CCACATCTTC 
TCTTTCTTCC TCGCTGTCCA CGATTACCTC TGGTGATGGC GGGCGCTCGG GCTTGGGAGA 
AGGGCGCTTC TTTTTCTTCT TGGGCGCAAT GGCCAAATCC GCCGCCGAGG TCGATGGCCG 
CGGGCTGGGT GTGCGCGGCA CCAGCGCGTC TTGTGATGAG TCTTCCTCGT CCTCGGACTC 
GATACGCCGC CTCATCCGCT TTTTTGGGGG CGCCCGGGGA GGCGGCGGCG ACGGGGACGG 
GGACGACACG TCCTCCATGG TTGGGGGACG TCGCGCCGCA CCGCGTCCGC GCTCGGGGGT 
GGTTTCGCGC TGCTCCTCTT CCCGACTGGC CATTTCCTTC TCCTATAGGC AGAAAAAGAT 
CATGGAGTCA GTCGAGAAGA AGGACAGCCT AACCGCCCCC TCTGAGTTCG CCACCACCGC 
CTCCACCGAT GCCGCCAACG CGCCTACCAC CTTCCCCGTC GAGGCACCCC CGCTTGAGGA 
GGAGGA.:^GTG ATTATCGAGC AGGACCCAGG TTTTGTAAGC GAAGACGACG AGGACCGCTC 
AGTACCAACA GAGGATAAAA AGCAAGACCA GGACAACGCA GAGGCAAACG AGGAACAAGT 
CGGGCGGGGG GACGAAAGGC ATGGCGACTA CCTAGATGTG GGAGACGACG TGCTGTTGAA 
GCATCTGCAG CGCCAGTGCG CCATTATCTG CGACGCGTTG CAAGAGCGCA GCGATGTGCC ' 
CCTCGCCATA GCGGATGTCA GCCTTGCCTA CGAACGCCAC CTATTCTCAC CGCGCGTACC 
CCCCAAACGC CAAGAAAACG GCACATGCGA GCCCAACCCG CGCCTCAACT TCTACCCCGT 
ATTTGCCGTG CCAGAGGTGC TTGCCACCTA TCACATCTTT TTCCAAAACT GCAAGATACC 
CCTATCCTGC CGTGCCAACC GCAGCCGAGC GGACAAGCAG CTGGCCTTGC GGCAGGGCGC 
TGTCATACCT GATATCGCCT CGCTCAACGA AGTGCCAAAA ATCTTTGAGG GTCTTGGACG 
CGACGAGAAG CGCGCGGCAA ACGCTCTGCA ACAGGAAAAC AGCGAAAATG AAAGTCACTC 
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TGGAGTGTTG GTGGAA* 




AGGGTGACAA CGCGCGCCTA GCCGTAC 




AACGCAGCAT 
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26340 



CGAGGTCACC CACTTTGCCT ACCCGGCACT TAACCTACCC CCCAAGGTCA TGAGCACAGT 2 64 00 

CATGAGTGAG CTGATCGTGC GCCGTGCGCA GCCCCTGGAG AGGGATGCAA ATTTGCAAGA 2 6460 

ACAAACAGAG GAGGGCCTAC CCGCAGTTGG CGACGAGCAG CTAGCGCGCT GGCTTCAAAC 2 6 520 

GCGCGAGCCT GCCGACTTGG AGGAGCGACG CAAACTAATG ATGGCCGCAG TGCTCGTTAC 2 6 580 

CGTGGAGCTT GAGTGCATGC AGCGGTTCTT TGCTGACCCG GAGATGCAGC GCAAGCTAGA 2 664 0 

GGAAACATTG CACTACACCT TTCGACAGGG CTACGTACGC CAGGCCTGCA AGATCTCCAA 2 6 700 

CGTGGAGCTC TGCAACCTGG TCTCCTACCT TGGAATTTTG CACGAAAACC GCCTTGGGCA 2 6 760 

AAACGTGCTT CATTCCACGC TCAAGGGCGA GGCGCGCCGC GACTACGTCC GCGACTGCGT 2 6 820 

TTACTTATTT CTATGCTACA CCTGGCAGAC GGCCATGGGC GTTTGGCAGC AGTGCTTGGA 2 6880 

GGAGTGCAAC CTCAAGGAGC TGCAGAAACT GCTAAAGCAA AACTTGAAGG ACCTATGGAC 2 6 940 

GGCCTTCAAC GAGCGCTCCG TGGCCGCGCA CCTGGCGGAC ATCATTTTCC CCGAACGCCT 2 7000 

GCTTAAAACC CTGCAACAGG GTCTGCCAGA CTTCACCAGT CAAAGCATGT TGCAGAACTT 2 706 0 

TAGGAACTTT ATCCTAGAGC GCTCAGGAAT CTTGCCCGCC ACCTGCTGTG CACTTCCTAG 2 7120 

CGACTTTGTG CCCATTAAGT ACCGCGAATG CCCTCCGCCG CTTTGGGGCC ACTGCTACCT 2 7180 

TCTGCAGCTA GCCAACTACC TTGCCTACCA CTCTGACATA ATGGAAGACG TGAGCGGTGA 2 724 0 

CGGTCTACTG GAGTGTCACT GTCGCTGCAA CCTATGCACC CCGCACCGCT CCCTGGTTTG 2 73 00 

CAATTCGCAG CTGCTTAACG AAAGTCAAAT TATCGGTACC TTTGAGCTGC AGGGTCCCTC 2 7360 

GCCTGACGAA AAGTCCGCGG CTCCGGGGTT GAAACTCACT CCGGGGCTGT GGACGTCGGC 2 7420 

TTACCTTCGC AAATTTGTAC CTGAGGACTA CCACGCCCAC GAGATTAGGT TCTACGAAGA 2 7480 

CCAATCCCGC CCGCCAAATG CGGAGCTTAC CGCCTGCGTC ATTACCCAGG GCCACATTCT 2 7 540 

TGGCCAATTG CAAGCCATCA ACAAAGCCCG CCAAGAGTTT CTGCTACGAA AGGGACGGGG 2 7600 

GGTTTACTTG GACCCCCAGT CCGGCGAGGA GCTCAACCCA ATCCCCCCGC CGCCGCAGCC 2 766 0 

CTATCAGCAG CAGCCGCGGG CCCTTGCTTC CCAGGATGGC ACCCAAAAAG AAGCTGCAGC 27720 

TGCCGCCGCC ACCCACGGAC GAGGAGGAAT ACTGGGACAG TCAGGCAGAG GAGGTTTTGG 2 7 780 

ACGAGGAGGA GGAGGACATG ATGGAAGACT GGGAGAGCCT AGACGAGGAA GCTTCCGAGG 2 7840 

TCGAAGAGGT GTCAGACGAA ACACCGTCAC CCTCGGTCGC ATTCCCCTCG CCGGCGCCCC 27 900 

AGAAATCGGC AACCGGTTCC AGCATGGCTA CAACCTCCGC TCCTCAGGCG CCGCCGGCAC 2 7 960 

TGCCCGTTCG CCGACCCAAC CGTAGATGGG ACACCACTGG AACCAGGGCC GGTAAGTCCA 2 8 020 

AGCAGCCGCC GCCGTTAGCC CAAGAGCAAC AACAGCGCCA AGGCTACCGC TCATGGCGCG 2 8 080 

GGCACAAGAA CGCCATAGTT GCTTGCTTGC AAGACTGTGG GGGCAACATC TCCTTCGCCC 2 814 0 

GCCGCTTTCT TCTCTACCAT CACGGCGTGG CCTTCCCCCG TAACATCCTG CATTACTACC 2 8200 

GTCATCTCTA CAGCCCATAC TGCACCGGCG GCAGCGGCAG CGGCAGCAAC AGCAGCGGCC 2 8260 

ACACAGAAGC AAAGGCGACC GGATAGCAAG ACTCTGACAA AGCCCAAGAA ATCCACAGCG 2 8 320 

GCGGCAGCAG CAGGAGGAGG AGCGCTGCGT CTGGCGCCCA ACGAACCCGT ATCGACCCGC 28 380 
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GAGCTTAGAA ACAGGaW TCCCACTCTG TATGCTATAT TTCAACAGAG CAGGGGCCAA 29440 

GAACAAGAGC TGAAAATAAA AAACAGGTCT CTGCGATCCC TCACCCGCAG CTGCCTGTAT 28500 

CACAAAAGCG AAGATCAGCT TCGGCGCACG CTGGAAGACG CGGAGGCTCT CTTCAGTAAA ^3560 

TACTGCGCGC TGACTCTTAA GGACTAGTTT CGCGCCCTTT CTCAAATTTA AGCGCGAAAA 28S20 

CTACGTCATC TCCAGCGGCC ACACCCGGCG CCAGCACCTG TCGTCAGCGC CATTATGAGC .8680 

AAGGAAATTC CCACGCCCTA CATGTGGAGT TACCAGCCAC AAATGGGACT TGCGGCTGGA -^3 74 0 

GCTGCCCAAG ACTACTCAAC CCGAATAAAC TACATGAGCG CGGGACCCCA CATGATATCC 28800 

CGGGTCAACG GAATCCGCGC CCACCGAAAC CGAATTCTCT TGGAACAGGC GGCTATTACC 26860 

ACCACACCTC GTAATAACCT TAATCCCCGT AGTTGGCCCG CTGCCCTGGT GTACCAGGAA ,8920 

AGTCCCGCTC CCACCACTGT GGTACTTCCC AGAGACGCCC AGGCCGAAGT TCAGATGACT 28980 
AACTCAGGGG CGCAGCTTGC GGGCGGCTTT CGTCACAGGG TGCGGTCGCC CGGGCAGGGT 
ATAACTCACC TGACAATCAG AGGGCGAGGT ATTCAGCTCA ACGACGAGTC GGTGAGCTCC 
TCGCTTGGTC TCCGTCCGGA CGGGACATTT CAGATCGGCG GCGCCGGCCG TCCTTCATTC 
ACGCCTCGTC AGGCAATCCT AACTCTGCAG ACCTCGTCCT CTGAGCCGCG CTCTGGAGGC 
ATTGGAACTC TGCAATTTAT TGAGGAGTTT GTGCCATCGG TCTACTTT.AA CCCCTTCTCG 
GGACCTCCCG GCCACTATCC GGATCAATTT ATTCCTAACT TTGACGCGGT AAAGGACTCG 

GCGGACGGCT ACGACTGAAT GTTAATT.AAG TTCCTGTCCA TCCGCACCCA CTATCTTCAT 29400 

GTTGTTGCAG ATGAAGCGCG CAAGACCGTC TGAAGATACC TTCAACCCCG TGTATCCATA 29460 

TGACACGGAA ACCGGTCCTC CAACTGTGCC TTTTCTTACT CCTCCCTTTG TATCCCCCAA 29S20 

TGGGTTTCAA GAGAGTCCCC CTGGGGTACT CTCTTTGCGC CTATCCGAAC CTCTAGTTAC 29580 

CTCCAATGGC ATGCTTGCGC TCAAAATGGG CAACGGCCTC TCTCTGGACG AGGCCGGCAA 29640 

CCTTACCTCC CAAAATGTAA CCACTGTGAG CCCACCTCTC AAAAAAACCA AGTCAAACAT 29700 

AAACCTGGAA ATATCTGCAC CCCTCACAGT TACCTCAGAA GCCCTAACTG TGGCTGCCGC 29760 

CGCACCTCTA ATGGTCGCGG GCAACACACT CACCATGCAA TCACAGGCCC CGCTAACCGT 29820 

GCACGACTCC AAACTTAGCA TTGCCACCCA AGGACCCCTC ACAGTGTCAG AAGGAAAGCT 29880 
AGCCCTGCAA ACATCAGGCC CCCTCACCAC CACCGATAGC AGTACCCTTA CTATCACTGC 
CTCACCCCCT CTAACTACTG CCACTGGTAG CTTGGGCATT GACTTGAAAG AGCCCATTTA 
TACACAAAAT GGAAAACTAG GACTAAAGTA CGGGGCTCCT TTGCATGTAA CAGACGACCT 
AAACACTTTG ACCGTAGCAA CTGGTCCAGG TGTGACTATT AATAATACTT CCTTGCAAAC 

TAAAGTTACT GGAGCCTTGG GTTTTGATTC ACAAGGCAAT ATGCAACTTA ATGTAGCAGG 30180 

AGGACTAAGG ATTGATTCTC AAAACAGACG CCTTATACTT GATGTTAGTT ATCCGTTTGA 30240 

TGCTCAAAAC CAACTAAATC TAAGACTAGG ACAGGGCCCT CTTTTTATAA ACTCAGCCCA 30300 

CAACTTGGAT ATTAACTACA ACAAAGGCCT TTACTTGTTT ACAGCTTCAA ACAATTCCAA 30360 

AAAGCTTGAG GTTAACCTAA GCACTGCCAA GGGGTTGATG TTTGACGCTA CAGCCATAGC 30420 
CATTAATGCA GGAGATGGGC TTGAATTTGG TTCACCTAAT GCACCAAACA CAAATCCCCT 
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CAAAACAAAA ATTGGCC^^ GCCTAGAATT TGATTCAAAC AAGGCTAT^G TTCCTAAACT 3 0540 

AGGAACTGGC CTTAGTTTTG ACAGCACAGG TGCCATTACA GTAGGAAACA AAAATAATGA 3 06 00 

TAAGCTAACT TTGTGGACCA CACCAGCTCC ATCTCCTAAC TGTAGACTAA ATGCAGAGAA 3 0660 

AGATGCTAAA CTCACTTTGG TCTTAACAAA ATGTGGCAGT CAAATACTTG CTACAGTTTC 3 0720 

AGTTTTGGCT GTTAAAGGCA GTTTGGCTCC AATATCTGGA ACAGTTCAAA GTGCTCATCT 3 0780 

TATTATAAGA TTTGACGAAA ATGGAGTGCT ACTAAACAAT TCCTTCCTGG ACCCAGAATA 3 084 0 

TTGGAACTTT AGAAATGGAG ATCTTACTGA AGGCACAGCC TATACAAACG CTGTTGGATT 3 0 900 

TATGCCTAAC CTATCAGCTT ATCCAAAATC TCACGGTAAA ACTGCCAAAA GTAACATTGT 3 0960 

CAGTCAAGTT TACTTAAACG GAGACAAAAC TAAACCTGTA ACACTAACCA TTACACTAAA 310 20 

CGGTACACAG GAAACAGGAG ACACAACTCC AAGTGCATAC TCTATGTCAT TTTCATGGGA 310 8 0 

CTGGTCTGGC CACAACTACA TTAATGAAAT ATTTGCCACA TCCTCTTACA CTTTTTCATA 3114 0 

CATTGCCCAA GAATAAAGAA TCGTTTGTGT TATGTTTCAA CGTGTTTATT TTTCAATTGC 312 0 0 

AGAAAATTTC AAGTCATTTT TCATTCAGTA GTATAGCCCC ACCACCACAT AGCTTATACA 312 6 0 

GATCACCGTA CCTTAATCAA ACTCACAGAA CCCTAGTATT CAACCTGCCA CCTCCCTCCC 313 2 0 

AACACACAGA GTACACAGTC CTTTCTCCCC GGCTGGCCTT AAAAAGCATC ATATCATGGG 313 8 0 

TAACAGACAT ATTCTTAGGT GTTATATTCC ACACGGTTTC CTGTCGAGCC AAACGCTCAT 314 4 0 

CAGTGATATT AATAAACTCC CCGGGCAGCT CACTTAAGTT CATGTCGCTG TCCAGCTGCT 315 00 

GAGCCACAGG CTGCTGTCCA ACTTGCGGTT GCTTAACGGG CGGCGAAGGA GAAGTCCACG 31560 

CCTACATGGG GGTAGAGTCA TAATCGTGCA TCAGGATAGG GCGGTGGTGC TGCAGCAGCG 316 2 0 

CGCGAATAAA CTGCTGCCGC CGCCGCTCCG TCCTGCAGGA ATACAACATG GCAGTGGTCT 316 8 0 

CCTCAGCGAT GATTCGCACC GCCCGCAGCA TAAGGCGCCT TGTCCTCCGG GCACAGCAGC 31740 

GCACCCTGAT CTCACTTAAA TCAGCACAGT AACTGCAGCA CAGCACCACA ATATTGTTCA 318 00 

AAATCCCACA GTGCAAGGCG CTGTATCCAA AGCTCATGGC GGGGACCACA GAACCCACGT 31860 

GGCCATCATA CCACAAGCGC AGGTAGATTA AGTGGCGACC CCTCATAAAC ACGCTGGACA 31920 

TAAACATTAC CTCTTTTGGC ATGTTGTAAT TCACCACCTC CCGGTACCAT ATAAACCTCT 31980 

GATTAAACAT GGCGCCATCC ACCACCATCC TAAACCAGCT GGCCAAAACC TGCCCGCCGG 3 2 040 

CTATACACTG CAGGGAACCG GGACTGGAAC AATGACAGTG GAGAGCCCAG GACTCGTAAC 32100 

CATGGATCAT CATGCTCGTC ATGATATCAA TGTTGGCACA ACACAGGCAC ACGTGCATAC 3 216 0 

ACTTCCTCAG GATTACAAGC TCCTCCCGCG TTAGAACCAT ATCCCAGGGA ACAACCCATT 32220 

CCTGAATCAG CGTAAATCCC ACACTGCAGG GAAGACCTCG CACGTAACTC ACGTTGTGCA 3 228 0 

TTGTCAAAGT GTTACATTCG GGCAGCAGCG GATGATCCTC CAGTATGGTA GCGCGGGTTT 3 2 340 

CTGTCTCAAA AGGAGGTAGA CGATCCCTAC TGTACGGAGT GCGCCGAGAC AACCGAGATC 3 24 00 

GTGTTGGTCG TAGTGTCATG CCAAATGGAA CGCCGGACGT AGTCATATTT CCTGAAGCAA 3 2 460 

AACCAGGTGC GGGCGTGACA AACAGATCTG CGTCTCCGGT CTCGCCGCTT AGATCGCTCT 3 2 520 

GTGTAGTAGT TGTAGTATAT CCACTCTCTC AAAGCATCCA GGCGCCCCCT GGCTTCGGGT 3 2 580 
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TCTATGTAAA CTCCT^Pb CGCCGCTGCC CTGATAACAT CCACCACCGC AGAATAAGCC 3 2 640 
ACACCCAGCC AACCTACACA TTCGTTCTGC GAGTCACACA CGGGAGGAGC GGGAAGAGCT 32700 
GGA.AGAACCA TGTTTTTTTT TTTATTCCAA AAGATTATCC AAAACCTCAA AATGAAGATC 32760 
TATTAAGTGA ACGCGCTCCC CTCCGGTGGC GTGGTCAAAC TCTACAGCCA AAGAACAGAT 32820 
AATGGCATTT GTAAGATGTT GCACAATGGC TTCCAAAAGG CAAACGGCCC TCACGTCCAA 32880 
GTGGACGTAA AGGCTAAACC CTTCAGGGTG AATCTCCTCT ATAAACATTC CAGCACCTTC 3 2 940 

AACCATGCCC AAATAATTCT CATCTCGCCA CCTTCTCAAT ATATCTCTAA GCAAATCCCG 33000 

AATATTAAGT CCGGCCATTG TAAAAATCTG CTCCAGAGCG CCCTCCACCT TCAGCCTCAA 33060 

GCAGCGAATC ATGATTGCAA AAATTCAGGT TCCTCACAGA CCTGTATAAG ATTCAAAAGC 3 3120 

GGAACATTAA CAAAAATACC GCGATCCCGT AGGTCCCTTC GCAGGGCCAG CTGAACATAA 33180 

TCGTGCAGGT CTGCACGGAC CAGCGCGGCC ACTTCCCCGC CAGGAACCTT GACAAAAGAA 33240 

CCCACACTGA TTATGACACG CATACTCGGA GCTATGCTAA CCAGCGTAGC CCCGATGTAA 33300 

GCTTTGTTGC ATGGGCGGCG ATATAAAATG CAAGGTGCTG CTCAAAAAAT CAGGCAAAGC 3 3 360 

CTCGCGCAAA AAAGAAAGCA CATCGTAGTC ATGCTCATGC AGATAAAGGC AGGTAAGCTC 33420 

CGGAACCACC ACAGAAAAAG ACACCATTTT TCTCTCAAAC ATGTCTGCGG GTTTCTGCAT 33480 

AAACACAAAA TAAAATAACA AAAAAACATT TAAACATTAG AAGCCTGTCT TACAACAGGA 3 3540 

AAAACAACCC TTATAAGCAT AAGACGGACT ACGGCCATGC CGGCGTGACC GTAAAAAAAC 33600 

TGGTCACCGT GATTAAAAAG CACCACCGAC AGCTCCTCGG TCATGTCCGG AGTCATAATG 33660 

TAAGACTCGG TAAACACATC AGGTTGATTC ATCGGTCAGT GCTAAAAAGC GACCGAAATA 33720 

GCCCGGGGGA ATACATACCC GCAGGCGTAG AGACAACATT ACAGCCCCCA TAGGAGGTAT 3 3 780 

AACAAAATTA ATAGGAGAGA AAAACACATA AACACCTGAA AAACCCTCCT GCCTAGGCAA 33840 

AATAGCACCC TCCCGCTCCA GAACAACATA CAGCGCTTCA CAGCGGCAGC CTAACAGTCA 3 3 900 

GCCTTACCAG TAAAAAAGAA AACCTATTAA AAAAACACCA CTCGACACGG CACCAGCTCA 3 3 960 
ATCAGTCACA GTGTAAAAAA GGGCCAAGTG CAGAGCGAGT ATATATAGGA CTAAAAAATG 
ACGTAACGGT TAAAGTCCAC AAAAAACACC CAGAAAACCG CACGCGAACC TACGCCCAGA 
AACGAAAGCC AAAAAACCCA CAACTTCCTC AAATCGTCAC TTCCGTTTTC CCACGTTACG 
TAACTTCCCA TTTTAAGAAA ACTACAATTC CCAACACATA CAAGTTACTC CGCCCTAAAA 
CCTACGTCAC CCGCCCCGTT CCCACGCCCC GCGCCACGTC ACAAACTCCA- CCCCCTCATT 
ATCATATTGG CTTCAATCCA AAATAAGGTA TATTATTGAT GAT 
(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 380 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

txi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
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CCAGGCTTTA 


CACTTT^WC 


TTCCGGCTCG 


TATGTTGTGT 


ggaattgWs 


GCGGATAACA 


60 


ATTTCACACA 


GGAAACAGCT 


ATGACCATGA 


TTACGCCAAG 


CGCGCAATTA 


ACCCTCACTA 


120 


AAGGGAACAA 


AAGCTGGGTA 


CCGGGCCCCC 


CCTCGAGGTC 


GACGGTATCG 


ATAAGCTTAC 


180 


GCGTGGCCTA 


GGCGGCCGAA 


TTCCTGCAGC 


CCGGGGGATC 


CACTAGTTCT 


agagcggccg 


240 


CCACCGCGGC 


GCCTTAATTA 


ATACGTAAGC 


TCCAATTCGC 


CCTATAGTGA 


GTCGTATTAC 


300 


GCGCGCTCAC 


TGGCCGTCGT 


TTTACAACGT 


CGTGACTGGG 


AAAACCCTGG 


CGTTACCCAA 


360 


CTTAATCGCC 


TTGCAGCACA 










380 



(2) INFORMATION FOR SEQ ID NO : 6 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

TGCCGCAGCA CCGGATGCAT C 21 

(2) INFORMATION FOR SEQ ID NO: 7: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNESS : s ingle 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

GCGTCCGGAG GCTGCCATGC GGCAGGG 2 7 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1481 base pairs 

(B) TYPE: nucleic acid" 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 



GGCTGCCATG 


CGGCAGGGAT 


ACGGCGCTAA 


CGATGCATCT 


CAACAATTGT 


TGTGTAGGTA 


60 


CTCCGCCGCC 


GAGGGACCTG 


AGCGAGTCCG 


CATCGACCGG 


ATCGGAAAAC 


CTCTCGAGAA 


120 


AGGCGTCTAA 


CCAGTCACAG 


TCGCAAGGTA 


GGCTGAGCAC 


CGTGGCGGGC 


GGCAGCGGGC 


180 


GGCGGTCGGG 


GTTGTTTCTG 


GCGGAGGTGC 


TGCTGATGAT 


GTAATTAAAG 


TAGGCGGTCT 


240 


TGAGACGGCG 


GATGGTCGAC 


AGAAGCACCA 


TGTCCTTGGG 


TCCGGCCTGC 


TGAATGCGCA 


300 


GGCGGTCGGC 


CATGCCCCAG 


GCTTCGTTTT 


GACATCGGCG 


CAGGTCTTTG 


TAGTAGTCTT 


360 


GCATGAGCCT 


TTCTACCGGC 


ACTTCTTCTT 


CTCCTTCCTC 


TTGTCCTGCA 


TCTCTTGCAT 


420 
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CTATCGCTGC GGCGGCdSW GAGTTTGGCC GTAGGTGGCG CCCTCTTCCT CCCATGCGTG 4 80 

TGACCCCGAA GCCCCTCATC GGCTGAAGCA GGGCTAGGTC GGCGACAACG CGCTCGGCTA 540 
ATATGGCCTG CTGCACCTGC GTGAGGGTAG ACTGGAAGTC ATCCATGTCC ACAAAGCGGT 
GGTATGCGCC CGTGTTGATG GTGTAAGTGC AGTTGGCCAT AACGGACCAG TTAACGGTCT 
GGTGACCCGG CTGCGAGAGC TCGGTGTACC TGAGACGCGA GTAAGCCCTC GAGTCAAATA 
CGTAGTCGTT GCAAGTCCGC ACCAGGTACT GGTATCCCAC CAAAAAGTGC GGCGGCGGCT 
GGCGGTAGAG GGGCCAGCGT AGGGTGGCCG GGGCTCCGGG GGCGAGATCT TCCAACATAA 
GGCGATGATA TCCGTAGATG TACCTGGACA TCCAGGTGAT GCCGGCGGCG GTGGTGGAGG 
CGCGCGGAAA GTCGCGGACG CGGTTCCAGA TGTTGCGCAG CGGC.AAAAAG TGCTCCATGG 
TCGGGACGCT CTGGCCGGTC AGGCGCGCGC AATCGTTGAC GCTCTACCGT GCAAAAGGAG 
AGCCTGTAAG CGGGCACTCT TCCGTGGTCT GGTGGATAAA TTCGCAAGGG TATCATGGCG 
GACGACCGGG GTTCGAGCCC CGTATCCGGC CGTCCGCCGT GATCCATGCG GTTACCGCCC 
GCGTGTCGAA CCCAGGTGTG CGACGTCAGA CAACGGGGGA GTGCTCCTTT TGGCTTCCTT 
CCAGGCGCGG CGGCTGCTGC GCTAGCTTTT TTGGCCACTG GCCGCGCGCA GCGTAAGCGG 
TTAGGCTGGA AAGCGAAAGC ATTAAGTGGC TCGCTCCCTG TAGCCGGAGG GTTATTTTCC 
AAGGGTTGAG TCGCGGGACC CCCGGTTCGA GTCTCGGACC GGCCGGACTG CGGCGAACGG 
GGGTTTGCCT CCCCGTCATG CAAGACCCCG CTTGCAAATT CCTCCGGAAA CAGGGACGAG 
CCCCTTTTTT GCTTTTCCCA GATGCATCCG GTGCTGCGGC A 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3364 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ix> MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

GAATTCCCCA TCCTGGTCTA TAGAGAGAGT TCCAGAACAG CCAGGGCTAC AGATAAACCC 6 0 

ATCTGGAAAA ACAAAGTTGA ATGACCCAAG AGGGGTTCTC AGAGGGTGGC GTGTGCTCCC 12 0 

TGGCAAGCCT ATGACATGGC CGGGGCCTGC CTCTCTCTGC CTCTGACCCT CAGTGGCTCC 180 

CATGAACTCC TTGCCCAATG GCATCTTTTT CCTGCGCTCC TTGGGTTATT CCAGTCTCCC 24 0 

CTCAGCATTC CTTCCTCAGG GCCTCGCTCT TCTCTCTGCT CCCTCCTTGC ACAGCTGGCT 3 00 

CTGTCCACCT CAGATGTCAC AGTGCTCTCT CAGAGGAGGA AGGCACCATG TACCCTCTGT 360 

TTCCCAGGTA AGGGTTCAAT TTTTAAAAAT GGTTTTTTGT TTGTTTGTTT GTTTGTTTGT 42 0 

TTGTTTGTTT TTCAAGACAG GGCTCCTCTG TGTAGTCCTA ACTGTCTTGA AACTCCCTCT 480 

GTAGACCAGG TCGACCTCGA ACTCTTGAAA CCTGCCACGG ACCACCCAGT CAGGTATGGA 54 0 

GGTCCCTGGA ATGAGCGTCC TCGAAGCTAG GTGGGTAAGG GTTCGGCGGT GACAAACAGA 6 00 

AACAAACACA GAGGCAGTTT GAATCTGAGT GTATTTTGCA GCTCTCAAGC AGGGGATTTT 66 0 
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600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1481 
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AAAAAAAACC 


AAACATTACA 


tctcttagtK 


ACTATATCCA 


720 


ATGAAACAAT 


CACAGATACC 


AACCAAAACC 


ATTGGGCAGA 


GTAAAGCACA 


AAAATCATCC 


780 


AAGCATTACA 


ACTCTGAAAC 


CATGTATTCA 


GTGAATCACA 


aacagaacag 


GTAACATCAT 


840 


TATTAATATA 


AATCACCAAA 


ATATAACAAT 


TCTAAAAGGA 


TGTATCCAGT 


GGGGGCTGTC 


900 


GTCCAAGGCT 


AGTGGCAGAT 


TTCCAGGAGC 


AGGTTAGTAA 


ATCTTAACCA 


CTGAACTAAC 


960 


TCTCCAGCCC 


CATGGTCAAT 


TATTATTTAG 


CATCTAGTGC 


CTAATTTTTT 


TTTATAAATC 


1020 


TTCACTATGT 


AATTTAAAAC 


TATTTTAATT 


CTTCCTAATT 


AAGGCTTTCT 


TTACCATATA 


1080 


CCAAAATTCA 


CCTCCAATGA 


CACACGCGTA 


GCCATATGAA 


ATTTTATTGT 


TGGGAAAATT 


1140 


TGTACCTATC 


ATAATAGTTT 


TGTAAATGAT 


TTAAAAAGCA 


AAGTGTTAGC 


CGGGCGTGGT 


1200 


GGCACACGCC 


TTTAATCCCT 


GCACTCGGGA 


GGCAGGGGCA 


GGAGGATTTC 


TGAGTTTGAG 


1260 


GCCAGCCTGG 


TCTACAGAGT 


GAGTTCCAGG 


ACAGCCAGGG 


CTACACAGAG 


AAACCCTGTC 


1320 


TCGAACCCCC 


CACCCCCCAA 


AAAAAGCAAA 


GTGTTGGTTT 


CCTTGGGGAT 


AAAGTCATGT 


1380 


TAGTGGCCCA 


TCTCTAGGCC 


CATCTCACCC 


ATTATTCTCG 


CTTAAGATCT 


TGGCCTAGGC 


1440 


TACCAGGAAC 


ATGTAAATAA 


GAAAAGGAAT 


AAGAGAAAAC 


AAAACAGAGA 


GATTGCCATG 


1500 


AGAACTACGG 


CTCAATATTT 


TTTCTCTCCG 


GCGAAGAGTT 


CCACAACCAT 


CTCCAGGAGG 


1560 


CCTCCACGTT 


TTGAGGTCAA 


TGGCCTCAGT 


CTGTGGAACT 


TGTCACACAG 


ATCTTACTGG 


1620 


AGGTGGTGTG 


GCAGAAACCC 


ATTCCTTTTA 


GTGTCTTGGG 


CTAAAAGTAA 


AAGGCCCAGA 


1680 


GGAGGCCTTT 


GCTCATCTGA 


CCATGCTGAC 


AAGGAACACG 


GGTGCCAGGA 


CAGAGGCTGG 


1740 


ACCCCAGGAA 


CACCTTAAAC 


ACTTCTTCCC 


TTCTCCGCCC 


CCTAGAGCAG 


GCTCCCCTCA 


1800 


CCAGCCTGGG 


CAGAAATGGG 


GGAAGATGGA 


GTGAAGCCAT 


ACTGGCTACT 


CCAGAATCAA 


1860 


CAGAGGGAGC 


CGGGGGCAAT 


ACTGGAGAAG 


CTGGTCTCCC 


GGCAGGGGCA 


ATCCTGGCAC 


1920 


CTCCCAGGCA 


GAAGAGGAAA 


CTTCCACAGT 


GCATCTCACT 


TCCATGAATC 


CCCTCCTCGG 


1980 


ACTCTGAGGT 


CCTTGGTCAC 


AGCTGAGGTG 


CAAAAGGCTC 


CTGTCATATT 


GTGTCCTGCT 


2040 


CTGGTC7GCC 


TTCCACAGCT 


TGGGGGCCAC 


CTAGCCCACC 


TCTCCCTAGG 


GATGAGAGCA 


2100 


GCCACTACGG 


GTCTAGGCTG 


CCCATGTAAG 


GAGGCAAGGC 


CTGGGGACAC 


CCGAGATGCC 


2160 


TGGTTATAAT 


TAACCCAGAC 


ATGTGGCTGC 


CCCCCCCCCC 


CCAACACCTG 


CTGCCTGAGC 


2220 


CTCACCCCCA 


CCCCGGTGCC 


TGGGTCTTAG 


GCTCTGTACA 


CCATGGAGGA 


GAAGCTCGCT 


2280 


CTAAAAATAA 


CCCTGTCCCT 


GGTGGATCCA 


GGGTGAGGGG 


CAGGCTGAGG 


GCGGCCACTT 


2340 


CCCTCAGCCG 


CAGGTTTGTT 


TTCCCAAGAA 


TGGTTTTTCT 


GCTTCTGTAG 


CTTTTCCTGT 


2400 


CAATTCTGCC 


ATGGTGGAGC 


AGCCTGCACT 


GGGCTTCTGG 


GAGAAACCAA 


ACCGGGTTCT 


2460 


AACCTTTCAG 


CTACAGTTAT 


TGCCTTTCCT 


GTAGATGGGC 


GACTACAGCC 


CCACCCCCAC 


2520 


CCCCGTCTCC 


TGTATCCTTC 


CTGGGCCTGG 


GGATCCTAGG 


CTTTCACTGG 


AAATTTCCCC 


2580 


CCAGGTGCTG 


TAGGCTAGAG 


TCACGGCTCC 


CAAGAACAGT 


GCTTGCCTGG 


CATGCATGGT 


2640 


TCTGAACCTC 


CAACTGCAAA 


AAATGACACA 


TACCTTGACC 


CTTGGAAGGC 


TGAGGCAGGG 


2700 


GGATTGCCAT 


GAGTGCAAAG 


CCAGACTGGG 


TGGCATAGTT 


AGACCCTGTC 


TCAAAAAACC 


2760 
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AAAAACAATT 


AAATAAC^WT 


AGTCAGGCAA 


GTAATCCTAC 


TCGGGAGACT 


GAGGCAGAGG 


2820 


GATTGTTACA 


TGTCTGAGGC 


CAGCCTGGAC 


TACATAGGGT 


TTCAGGCTAG 


CCCTGTCTAC 


2880 


AGAGTAAGGC 


CCTATTTCAA 


AAACACAAAC 


AAAATGGTTC 


TCCCAGCTGC 


TAATGCTCAC 


2940 


CAGGCAATGA 


AGCCTGGTGA 


GCATTAGCAA 


TGAAGGCAAT 


GAAGGAGGGT 


GCTGGCTACA 


3000 


ATCAAGGCTG 


TGGGGGACTG 


AGGGCAGGCT 


GTAACAGGCT 


TGGGGGCCAG 


GGCTTATACG 


3060 


TGCCTGGGAC 


TCCCAAAGTA 


TTACTGTTCC 


ATGTTCCCGG 


CGAAGGGCCA 


GCTGTCCCCC 


3120 


GCCAGCTAGA 


CTCAGCACTT 


AGTTTAGGAA 


CCAGTGAGCA 


AGTCAGCCCT 


TGGGGCAGCC 


3180 


CATACAAGGC 


CATGGGGCTG 


GGCAAGCTGC 


ACGCCTGGGT 


CCGGGGTGGG 


CACGGTGCCC 


3240 


GGGCAACGAG 


CTGAAAGCTC 


ATCTGCTCTC 


AGGGGCCCCT 


CCCTGGGGAC 


AGCCCCTCCT 


3300 


GGCTAGTCAC 


ACCCTGTAGG 


CTCCTCTATA 


TAACCCAGGG 


GCACAGGGGC 


TGCCCCCGGG 


3360 


TCAC 












3364 



(2) INFORMATION FOR SEQ ID NQrlQ: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GGTACCACTA CGGGTCTAGG CTGCCCATGT AAGGAGGCAA GGCCTGGGGA CACCCGAGAT 
GCCTGGTTAT AATTAACCCC AACACCTGCT GCCCCCCCCC CCCCAACACC TGCTGCCTGA 
GCCTGAGCGG TTACCCCACC CCGGTGCCTG GGTCTTAGGC TCTGTACACC ATGG 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 699 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: Other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

AGATCTTCCT AGAAATTCGC TGTCTGCGAG GGCCGGCTGT TGGGGTGAGT ACTCCCTCTC 

AAAAGCGGGC ATGACTTCTG CGCTAAGATT GTCAGTTTCC AAAAACGAGG AGGATTTGAT 

ATTCACCTGG CCCGCGGTGA TGCCTTTGAG GGTGGCCGCG TCCATCTGGT CAGAAAAGAC 

AATCTTTTTG TTGTCAAGCT TGAGGTGTGG CAGGCTTGAG ATCGATCTGG CCATACACTT 

GAGTGACAAT GACATCCACT TTGCCTTTCT CTCCACAGGT GTCCACTCCC AGGTCCAACC 

GCGGATCTCC CGGGACCATG CCCAAGAAGA AGAGGAAGGT GTCCAATTTA CTGACCGTAC 

ACCAAAATTT GCCTGCATTA CCGGTCGATG CAACGAGTGA TGAGGTTCGC AAGAACCTGA 

TGGACATGTT CAGGGATCGC CAGGCGTTTT CTGAGCATAC CTGGAAAATG CTTCTGTCCG 



60 
120 
174 



60 
120 
180 
240 
300 
360 
420 
480 
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TTTGCCGGTC GTGGGCGGCA TGGTGCAAGT TGAATAACCG GAAATGGTTT CCCGCAGAAC 54 0 

CTGAAGATGT TCGCGATTAT CTTCTATATC TTCAGGCGCG CGGTCTGGCA GTAAAAACTA 6 00 

TCCAGCAACA TTTGGGCCAG CTAAACATGC TTCATCGTCG GTCCGGGCTG CCACGACCAA 66 0 

GTGACAGCAA TGCTGTTTCA CTGGTTATGC GGCGGATCC 699 
(2) INFORMATION FOR SEQ ID NO:i2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 8 base pairs 

(B) TYPE: nucleic acrd 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12; 

GAAGATCTAT AACTTCGTAT AATGTATGCT ATACGAAGTT ATTACCGAAG AAATGGCTCG 6 0 

AGATCTTC 6 3 

(2) INFORMATION FOR SEQ ID NO: 13: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GAAGATCTCG AGCCATTTCT TCGGTAATAA CTTCGTATAG CATACATTAT ACGAAGTTAT 6 0 

AGATCTTC 6 g 

(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

CCACATGTAT AACTTCGTAT AGCATACATT ATACGAAGTT ATACATGTGG 5 0 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 5 : 

CCACATGTAT AACTTCGTAT AATGTATGCT ATACGAAGTT ATACATGTGG 5 0 
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CLAIMS 



15 



A recombinant plasmid. comprising in operable combi 
a) a plasmid backbone, comprising an origin of repi 



nation: 
ication. an 



antibiotic resistance gene and a eukaryotic promoter element; 

'^^"t ^nd right inverted terminal repeats (ITRs) of 
adenovirus, said ITRs each having a and a 3" end and arranged ,n a tail 
to tail orientation on said plasmid backbone; 

c) the adenovirus packaging sequence, said packagmg sequence 
having a 5- and a 3 - end and linked to one of said ITRs; and 

'0 d) a first gene of interest operably linked to said promoter 

element. 



2. The recombinant plasmid of Claim 1 wherein the total size of said 
plasmid is between 27 and 40 kilobase pairs. 



3. The recombinant plasmid of Claim 1 . wherein said 5' end of said 
packaging sequence is linked to said 3" end of said left ITR. 

4. The recombinant plasmid of Claim 3. wherein said first gene of 
interest is linked to said 3" end of said packaging sequence. 

5. The recombinant plasmid of Claim 4. wherein said first gene of 
interest is the dystrophin cDNA gene. 



20 6 



The recombinam plasmid of Claim 4. further comprising a second 
gene of interest. 



7. The recombinant plasmid of Claim 6. wherein said second gene of 
interest is linked to said 3' end of said right ITR. 

8. The recombinant plasmid of Claim 7. wherein said second gene of 
interest is a reporter gene. 
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9. nlie recombinant plasmid of Claim 8. xvlierein said reporter uene is 
selected from the group consisting of the E. cy;// (i-galaciosidase gene, the human 
placental alkaline phosphatase gene, the green lluroescent protein gene and the 
choramphenicol acetyltransferase gene. 



5 10. 
Claim 1 . 



A mammalian cell line containing the recombinant plasmid of 



1 i. The cell line of Claim 10. wherein said cell line is a 293-derived cell 



hne. 



12- A helper adenovirus comprising i) first and a second loxP sequences. 
10 and ii) the adenovirus packaging sequence, said packaging sequence havmg a 5" 

and a 3" end. 

13. The helper adenovirus of Claim 12. wherein said first /oxP sequence 
is linked to the 5' end of said packaging sequence and said second loxP sequence is 
linked to said 3* end of said packaging sequence. 

14. The helper adenovirus of Claim 12. further comprising at least one 
adenovirus gene coding region. 

15. A mammalian cell line, comprising: 

a) a recombinant plasmid. comprising, in operable combination: 

i) a plasmid backbone, comprising an origin of 

-0 replication, an antibiotic resistance gene and a eukaryotic promoter 

element, 

ii) the left and right inverted terminal repeats (ITRs) of 
adenovirus, said ITRs each having a 5' and a 3' end and arranged in 
a tail to tail orientation on said plasmid backbone. 

iii) the adenovirus packaging sequence, said packaging 
sequence having a 5' and a 3* end and linked to one of said ITRs. 
and 
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'V) a tirsi gene of interest operabiy linked to said 
promoter element; and 

b) a helper adenovirus comprising i) first and a second loxP 
sequences, and i.) the adenovirus packagmg sequence, said packaging 
sequence having a 5" and a 3" end 

16. The cell line of Claim 15. wherein said helper adenovirus further 
comprises at least one adenovirus gene coding region. 

17. The cell line of Claim 15. wherein said first gene of interest is the 
dystrophin cDNA gene 



15 



1 8. A method of producing an adenovirus minichromosome. comprising: 
A) providing a mammalian cell line containing: 

a) a recombinant plasmid. comprising, in operable 
combination, i) a plasmid backbone, comprising an origin of 
replication, an antibiotic resistance gene and a eukarotic promoter 
element, ii) the left and right inverted terminal repeats (ITRs) of 
adenovirus, said ITRs each having a and a 3' end and arranged in 
a tail to tail orientation on said plasmid backbone, iii) the adenovirus 
packaging sequence, said packaging sequence having a 5" and a 3' 
end and linked to one of said ITRs. and iv) a first gene of interest 
operabiy linked to said promoter element: and 

b) a helper adenovirus comprising i) first and a second 
loxP sequences, ii) at least one adenovirus gene coding region, and 
iii) the adenovirus packaging sequence, said packaging sequence 
having a 5" and a 3" end; and 

B) growing said cell line under conditions such that said 
adenovirus gene coding region is e.xpressed and said recombinant plasmid 
directs the production of at least one adenoviral minichromosome. 

1 9. The method of Claim 1 8. wherein said adenovirus minichromosome 
is encapidated. 
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20. ^-frie method of Claim 18. further comprisin;i 3) recoverine said 
encapsidaied adenovirus minichromosome. 

21. The method of Claim 20. further comprising 4) puntlying said 
recovered encapsidated adenovirus minichromosome. 

5 22. The method of Claim 21, further comprisng 5) administering said 

purified encapsidated adenovirus minichromosome to a host. 

23. The method of Claim 22. wherein said host is a mammal. 

24. The method of Claim 23, wherein said mammal is a human. 

25. A recombinant adenovirus comprising the adenovirus E2b region 
10 having a deletion, said adenovirus capable of self-propagaiion in a packaging cell 

line and said E2b region comprising the DNA polymerase gene and the adenovirus 
preterminal protein gene. 

26. The recombinant adenovirus of Claim 25. wherein said deletion is 
within the adenovirus DNA polymerase gene. 

15 27. The recombinant adenovirus of Claim 25. wherein said deletion is 

within the adenovirus preterminal protein gene. 

28. The recombinant adenovirus of Claim 25. wherein said deletion is 
within the adenovirus DNA polymerase and preterminal protein genes. 

29. A mammalian cell line stably and constiiutively expressing the 
20 adenovirus El gene products and the adenovirus DNA polymerase. 

30. The cell line of Claim 29. wherein said cell line comprises a 
recombinant adenovirus comprising a deletion within the E2b region, said 
recombinant adenovirus being capable of self-propagation in said cell line. 
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31. ^le cell line of Claim 30. wherein said deletion withm sa.d E2b 
region comprises a deletion w.thin the adenoviral DNA polymerase ize 



32. The cell of Cla,m 29. wherein the genome of sa.d ceil Ime contains 
a nucleotide sequence encoding adenov.rus DNA polymerase operablv |,nked to a 
heterologous promoter. 



10 



33. The cell line of Claim 32. wherein said cell line ,s selected from the 
group consisting of the B-6. B-9. C-I. C-4. C-7. C-I3. and C-14 cell lines. 

34. The cell line of Claim 29 further constitutively expressing the 
adenovirus preterminal protein gene product. 

35. The cell line of Claim 34. wherein said cell line comprises a 
recombinant adenovirus comprising a deletion within the E2b region, said 
recombinant adenovirus being capable of self-propagation in said cell line. 

36. The cell line of Claim 35. wherem said deletion within said E2b 
region comprises a deletion within the adenoviral preterminal protein gene. 

5 37. The cell line of Claim 36. wherein said deletion within said E2b 

region comprises a deletion within the adenoviral DNA polymerase and preterminal 
protein genes. 

38. . . The cell line of Claim 34. wherein the genome of said cell line 
contains a nucleotide sequence encoding adenovirus preterminal protein operably 

0 linked to a heterologous promoter. 

39. The cell line of Claim 38. wherein said cell line is selected from the 
group cotisisting of the C-1. C-4. C-7. C-13. and C-14 cell lines. 
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40. /\ method of producing infectious recomomant adenovirus particles 
containing an adenoviral genome containing a deletion within the E2b rei^ion. 
comprising 

a) providing: 

5 i) a mammalian cell line stably and consiitutiveiv 

expressing the adenovirus El gene products and the adenovirus DNA 
polymerase: 

ii> a recombinant adenovirus comprising a deletion within 
the E2b region, said recombinant adenovirus being capable of self- 
10 propagation in said cell line; 

b) introducing said recombinant adenovirus into said cell line 
under conditions such that said recombinant adenovirus is propagated to 
form infectious recombinant adenovirus particles; and 

c) recovering said infectious recombinant adenovirus particles. 

^5 41. The method of Claim 40, further comprising d) purifying said 

recovered infectious recombinant adenovirus particles. 

42. The method of Claim 41, further comprising e) administering said 
purified recombinant adenovirus particles to a host. 

43. The method of Claim 40, wherein said mammalian cell line further 
20 constitutively expresses the adenovirus preterminal protein. 

44. A recombinant plasmid capable of replicating in a bacterial host 
comprising adenoviral E2b sequences, said E2b sequences containing a deletion 
within the polymerase gene, said deletion resulting in reduced polymerase activity. 

45. The recombinant plasmid of Claim 44. wherein said deletion 
25 comprises a deletion of nucleotides 8772 to 9385 in SEQ ID NO:4. 

46. The recombinant plasmid of Claim 45. wherein said plasmid has the 
designation pApoL 
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47. W recombinam plasmid of Claim 45. wherein said plasmid has the 
designation pBHGl lApol. 



48. A recombinant plasmid capable of replicating in a bacterial host 
comprising adenoviral E2b sequences, said E2b sequences containing a deletion 
within the preterminal protein gene, said deletion resulting in the inability to 
express functional preterminal protein without disruption of the V.A RNA aenes. 

49. The recombinant plasmid of Claim 48. wherein said deletion within 
said preterminal protein gene comprises a deletion of nucleotides 10.705 to 1 1,134 
in SEQ ID N0:4. 



10 



15 



50. The recombinant plasmid of Claim 49. wherein .said plasmid has the 
designation pApTP. 

51. The recombinant plasmid of Claim 49, wherein said plasmid has the 
designation pBHGl lApTP. 

52. The recombinant plasmid of Claim 48 further comprising a deletion 
within the polymerase gene, said deletion resulting in reduced polymerase activity. 

53. The recombinant plasmid of Claim 52. wherein said deletion within 
said polymerase and said preterminal protein genes comprises a deletion of 
nucleotides 8.773 to 9586 and 1 1.067 to 12.513 in SEQ ID NO:4. 

54. The recombinant plasmid of Claim 53, wherein said plasmid has the 
20 designation pAXBApolApTPVARNA+tl3. 

55. The recombinant plasmid of Claim 53. wherein said plasmid has the 
designation pBHGl 1 ApolApTPVARNA+tl3. 
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